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Two  studies  examined  the  effects  of  display  factors  on 
observers'  ability  to  use  multiple  sources  in  visual  signal 
detection.  The  information  provided  by  a  given  source  was 
represented  as  a  value  on  a  graphical  element.  Each  dis- 
played value  was  an  independent  sample  from  one  of  two 
normal  distributions,  depending  on  the  type  of  trial  (Signal 
or  Noise)  and  the  task  being  performed  (Yes/No  or  Four- 
Alternative-Forced-Choice,  4AFC) . 

The  first  study  examined  observers'  ability  to  use 
differences  in  source  reliability  in  performing  a  Yes/No 
decision  task.  The  reliability  of  the  different  display 
elements  was  controlled  by  manipulating  the  variance  of  the 
distributions  from  which  the  element  values  were  sampled 
(high  reliability  -  low  variance).  Observers'  efficiency  in 
weighting  the  sources  based  on  their  reliability  was 
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estimated.  Observers  were  relatively  inefficient  at  using 
reliability  information  in  forming  a  two-alternative  deci- 
sion (signal  or  noise) .  Only  when  a  luminance  cue  to 
source  reliability  was  introduced  at  stimulus  durations 
equal  to  or  greater  than  400  ms  was  observer  performance 
equivalent  to  an  equal  reliability  condition.  The  evidence 
suggests  that  luminance  cues  aid  observers  in  prioritizing 
visual  information  sources  according  to  their  importance  to 
the  task. 

The  second  study  examined  the  effects  of  display  element 
arrangement  on  observers'  performance  in  both  Yes/No  and 
4AFC  visual  signal  detection  tasks.  The  information  was 
displayed  graphically  in  one  of  six  formats  constructed 
from  a  combination  of  two  factors:  1)  whether  or  not  the 
display  elements  were  arranged  to  produce  a  global  feature 
that  resulted  from  the  interaction  of  the  separate  display 
elements,  an  "emergent  feature,"  and  2)  whether  or  not  the 
magnitude  of  this  global  feature  was  monotonically  related 
to  the  optimal  decision  statistic  (for  the  Yes/No  task). 
The  results  indicate  that  performance  was  facilitated  by  an 
emergent  feature  in  the  Yes/No  task  and  was  hindered  by  the 
presence  of  an  emergent  feature  in  the  4AFC  task.  Due  to 
the  relatively  high  performance  produced  by  an  angular 
element  code,  it  was  not  possible  to  determine  whether 
visual  signal  detection  was  affected  by  the  presence  of  a 
relationship  between  an  emergent  feature  and  the  optimal 
decision   statistic. 


MATHEMATICAL  MODELS 
Introduction 

Every  day,  humans  are  faced  with  uncertain  circumstances 
in  which  they  have  to  form  decisions  based  on  multiple 
sources  of  information.  In  some  situations  these  decisions 
have  to  be  made  rapidly,  possibly  to  avoid  an  unfortunate 
outcome.  For  example,  air  traffic  controllers  have  to 
detect  and  respond  to  selected  events  under  time  stress  in 
order  to  avoid  potential  aircraft  collisions.  In  many 
situations,  the  information  is  conveyed  to  the  decision 
maker  via  visual  displays.  As  a  result,  researchers  are 
interested  in  determining  how  efficiently  observers  can 
combine  spatially  and  temporally  presented  visual  informa- 
tion sources,  and  in  identifying  the  factors  which  influence 
overall  processing  efficiency. 

The  current  investigation  examines  observers'  use  of 
multiple,  spatially  presented,  independent,  visual  informa- 
tion sources  in  forming  detection  decisions.  Using  the 
Theory  of  Signal  Detectability  (TSD,  Green  &  Swets,  1966) 
paradigm,  we  can  specify  the  performance  of  an  optimal 
observer  in  different  detection  tasks.  The  central  theme  of 
this  investigation  was  to  identify  whether  selected  display 
coding  factors,  partly  derived  from  knowledge  of  the  optimal 
observer,  would  assist  observers'  detection  decisions. 


The  first  study  examines  observers'  ability  to  combine 
nine  independent,  informational  sources  to  form  a  Yes/No 
detection  decision  (Signal  or  Noise) .  The  information  is 
coded  as  graphical  elements  in  the  visual  field,  and  in  some 
conditions  the  sources  differed  in  their  reliability. 
Frequently,  decisions  are  based  on  multiple  sources  of 
information  that  differ  in  their  inf ormativeness  (or  reli- 
ability) .  An  optimal  observer  includes  this  information  in 
her  detection  decision.  That  is,  she  weights  the  informa- 
tion according  to  inf ormativeness .  However,  when  decisions 
need  to  be  made  rapidly,  observers  do  not  always  consider 
all  relevant  information.  The  observer  may  not  consider  all 
sources  or  she  may  not  apply  an  optimal  weighing  strategy. 
Thus,  the  main  concern  of  this  study  was  to  determine 
whether  selected  factors  assist  observers  in  directing  their 
attention  to  more  reliable  informational  sources. 

The  second  study  further  examines  the  effects  of 
selected  display  formats  on  observers'  detection  decisions. 
Observers  were  given  four  informational  sources  to  perform 
either  a  Yes/No  task,  as  in  the  first  study,  or  a  Four- 
Alternative-Forced-Choice  (4AFC)  detection  task.  Bennett 
and  Flach  (1992)  summarize  the  results  from  a  number  of 
studies  which  suggest  that  factors  related  to  the  display 
element  arrangement  can  differentially  affect  performance  in 
these  two  detection  tasks.  That  is,  selected  display 
arrangements  are  more  likely  to  facilitate  performance  in 
tasks   which   require  integration  of  information,   as   in   a 


Yes/No  task,  than  performance  in  tasks  which  require   focused 
attention,  as  in  a  4AFC  task.   This  study  attempts  to   iden- 
tify  the  importance  of  two  factors  related  to  display   ele- 
ment  arrangement  which  may  be   contributing   to   possible 
differences  in  performance  between  the  two  tasks. 

In  a  Yes/No  decision  task,  an  observer  is  given  a  sample 
of   n  independent  elements  (x, ,  x^,     ...  x^)  to  decide   which 
of   two   alternative   events  (signal  or  noise)   led   to   the 
evidence  observed.   On  a  given  trial,  one  of  the  two   stimu- 
lus  alternatives  is  true,  and  each  element   conveys    inde- 
pendent  information   about  the  current   state.    On   signal 
trials,   each  x.  is  drawn  from  a  normal  distribution  with   a 
mean  of  ^l^  and  a  standard  deviation  of  o^.      On  noise  trials, 
each   Xj  is  drawn  from  a  normal  distribution  with  a  mean   of 
u  and  a  standard  deviation  of  a  .   Alternatively,  in  a  4AFC 
task,  on  each  trial  four  independent  elements  are  presented. 
The  values  of  three  of  the  sources  are  drawn  from  the   noise 
distribution   and  one  source  value  is  drawn  from  the   signal 
distribution.    The  observer  has  to  decide  which   source 
represents  the  "signal"  event. 

Employing  the  TSD  paradigm,  we  can  use  the  information 
about  the  underlying  distribution  parameters  to  identify  how 
an  optimal  observer  should  perform  in  each  of  these  tasks. 
That  is,  we  can  identify  the  optimal  performance  level  of  an 
observer  who  is  only  limited  by  the  uncertainty  of  the 
evidence,  and  who  uses  an  optimal  decision  strategy.  Given 
this   information,  we  can  1)  identify  how  well   an   observer 


performs  relative  to  the  ideal,  and  2)  attempt  to  facilitate 
observer  performance,  which  is  generally  inferior  to  the 
ideal,  by  presenting  the  information  in  a  manner  which  helps 
them  to  act  like  a  mathematically  ideal  observer. 

Defining  the  Optimal  Observer  in  Yes/No  Detection 
The  Theory  of  Signal  Detectability  (TSD,  Green  &  Swets , 
1966;  Green,  1992)  provides  a  quantitative  model  for 
describing  decisions  based  on  uncertain  evidence.  Since  it 
is  a  normative  theory,  it  prescribes  an  optimum  means  of 
combining  the  information  to  form  a  statistic  upon  which  an 
observer  can  base  her  decision.  According  to  TSD,  an  opti- 
mal decision  statistic  is  a  likelihood  ratio,  or  some  value 
that  is  monotonically  related  to  the  likelihood  ratio.  A 
likelihood  ratio  is  the  ratio  of  the  conditional  probabili- 
ties for  the  current  trial  evidence,  x.   That  is, 

L(x)  =  f(x|s)  /  f(x|n) .  (1) 

It   is  assumed  that  the  underlying  distributions  are  normal 
such  that  the  conditional  probabilities  can  be  expressed  as 
f(x|n)  =  [l/(27ra^)^^]EXP[-i5((x  -  M,)/aJM   and 
f(x|s)  =  [l/(27ra^)^]EXP[-!2((x  -  n^)/o^)n,  (2) 

where  a^  =   a^=   o^,    and  to  simplify  the  derivations  ^  <  M  • 
For   n  independent  sources  of  information,  x^ ,  x^,     •••,      x^, 
by  definition  the  probability  of  their  joint  occurrence   is 
the   product   of  their  separate   probabilities,   p(x,*X2)   = 
p(x^)p(X2).    Similarly,  the  likelihood  ratio   for  multiple 
independent   sources  can  be  expressed  as  the  product  of  the 
separate  likelihood  ratios,  L(x^*X2)  =  L(x,)*L(X2). 


since  the  natural  logarithm  of  the  likelihood  ratio  is 
monotonic  with  the  likelihood  ratio,  the  InL(X)  is  also  an 
optimal  decision  statistic.  Thus,  we  have  the  following 
equation: 

Z  =  lnL(x^,X2,  .  .  .  x^)  =  InL(x^)  +  InLCXj) +  •  •  . +lnL(x^)  .  (3) 
When  the  definitions  of  the  conditional  probabilities  for 
the  likelihood  ratios  are  included  in  equation  3  and  this 
equation  is  reduced,  it  turns  out  that  the  optimal  decision 
statistic  is  a  weighted  sum  of  the  evidence  (see  appendix  A 
for  the  derivation) , 

n 

Z  =  Ex.(  (M  .  -  M  O/cr  '  )  -  W  (M  '  -M  •  )/t^  •  )  /        (4) 

1 

where   Xj  is   the  i    source,  drawn  from   either   a   signal 
distribution,   Normal[/i  .,a  .],  or  the   noise   distribution, 
Normal  [/i^j ,  a^j  ]  .    Given  a  large  sample,  Z  is  also   normally 
distributed   since  it  is  the  sum  of  n  mutually   independent 
random  variables,  and  its  mean  and  variance  given  the   two 
alternatives  are 

n  n 

E(Z|s)  =  SU(M3,-Mni)/'^ei)'  '  E(Z|n)  =  -E^:  (  (m.-M.^) /a,,)  M5) 
1  1 

n 

VAR(Z)  =  E((M3-M„)/a^,)^  .  (6) 

1 

The  assumption  is  that  on  a  given  trial  an  ideal  observer 
will  compare  Z  to  some  decision  criterion,  D.  If  Z  is 
greater  than  or  equal  to  D,  then  the  observer  should  respond 
"signal."   Otherwise  she  will  say  "noise." 


When  the  separate  pieces  of  evidence  are  not  equally 
reliable,  an  ideal  observer  is  sensitive  to  these  differ- 
ences and  weighs  the  informational  sources  accordingly.  The 
informativeness,  and  thus  the  appropriate  weight,  for  a 
given  source  can  be  represented  by  the  d'  statistic  as 
follows: 

Expected  optimal  performance  in  a  detection  task  is  limited 
by  the  informativeness  of  the  underlying  evidence.  That  is, 
an  observer's  performance  given  a  single  source  will  not 
exceed  d'^.  Based  on  equation  7,  the  informativeness  of  a 
particular  source  can  be  manipulated  by  either  changing  the 
distance  between  the  two  distribution  means,  u  .  -  u  .  = 
Sfj..,    or  the  size  of  the  standard  deviation,  a   .. 

In   the   first  study,  Sijl.   is  held  constant  (Sii^    =   5^2 
=  ...(5/i^),  and  the  informativeness  of  the  separate  sources  is 
controlled   by  changing  a^..      Smaller  values  of  a   .  produce 
larger  d'^  values   and  thus   represent  more   informative 
sources.    Table  1  lists  the  distribution  parameters   corre- 
sponding with   a  condition  in  which  the  even   sources,   x^ , 
have  lower  variability,  making  them  relatively  more  informa- 
tive. 

The  optimal  weight  for  a  given  source,  a.,  is  related  to 
the  d'  value.   When  fi^.   is  held  constant,  a^  is  proportional 
to  the  reciprocal  of  the  variance  for  that  source. 


n 

a,.  =  1  /  [a^^S(l/a^;  )].  (8) 

1 


Table  1. 
The   mean   and   standard   deviation   of   five   informational 
sources   in  which  the  sources  alternate  in  reliability;   the 
even  elements  have  the  highest  reliability. 


element 

1 

2 

3 

4 

5 

siqnal 

Ms 

1 

1 

1 

1 

1 

^s 

1.5 

0.75 

1.5 

0.75 

1.5 

noise 

^n 

0 

0 

0 

0 

0 

^n 

1.5 

0.75 

1.5 

0.75 

1.5 

d' 

0.67 

1.33 

0.67 

1.33 

0.67 
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When  an  observer's  decision  is  based  on  multiple  sources, 
optimal  performance  is  expressed  in  terms  of  the  following 
d'  statistic: 

d'^=  ZiSn^/a^])    /    (E(<5M^/a^p)^^  =  [Sd'^]^  (9) 

1  1  1 

Equation   9  can,  then,  be  rewritten  to  include   the   optimal 

weights  as  follows: 


n  n 

ei 


d'ideai  =  [^/^i  ^^,]  /  [(2a|a^p  n-  (10) 

1       1 


If  the  weights  are  normalized  and  the  optimal  weighting 
pattern  requires  equal  weight  across  elements,  then  equation 
10  equals  the  product  of  the  square-root  of  n  and  d '  ^^ . 

The  preceding  equations  allow  us  to  define  the  perform- 
ance of  an  ideal  observer  who  is  only  limited  by  the  uncer- 
tainty of  the  evidence.  This  provides  a  standard  by  which 
we  can  compare  an  observer's  performance  which  is  fre- 
quently inferior  to  the  ideal.  For  a  multiple  observation 
task,  some  of  the  loss  in  performance  may  be  a  product  of 
observers  using  a  nonoptimal  weighting  strategy.  Other  loss 
may  be  more  generalized  (e.g.,  some  form  of  internal  noise), 
showing  up  as  an  overall  performance  loss.  To  discriminate 
the  effects  of  these  two  sources  of  error,  we  need  to  iden- 
tify how  the  observer  weights  the  separate  sources.  A 
technique  designed  by  Berg  (1989,  1990)  provides  a  means  for 
estimating  the  observer's  relative  weights. 

Generally  speaking,  the  observers'  weights  are  related 
to   the  slopes  of  empirical  cumulative  normal   distributions 


which  Berg  refers  to  as  Conditional-On-A-Single-Stimulus,  or 
COSS  functions.  A  COSS  function  is  a  plot  of  the  proportion 
of  times  an  observer  responded  "signal"  as  a  function  of  the 
magnitude  of  a  given  element  across  experimental  trials. 
Two  COSS  functions  are  calculated  for  each  element,  one  for 
signal  trials  and  one  for  noise  trials. 

Figure  1  shows  the  COSS  functions  derived  from  simulated 
data  of  an  observer  using  three  informational  sources  to 
perform  a  Yes/No  detection  decision.  The  COSS  functions  on 
the  left  represent  an  observer  using  an  equal  weighting 
strategy.  The  COSS  functions  on  the  right  represent  an 
observer  who  weights  the  first  source  most  and  the  third 
source  the  least.  The  upper  curves  with  the  square  symbols 
in  each  graph  of  figure  1  represent  the  COSS  functions  for 
the  signal  trials.  The  lower  curves  with  the  circles  in 
each  graph  represent  the  COSS  functions  for  the  noise 
trials.  Figure  2  depicts  the  weights  of  the  three  sources 
derived  from  the  COSS  functions  shown  in  Figure  1.  The 
small  squares  and  the  circles  represent  the  weight  estimates 
for  the  signal  and  noise  trials,  respectively.  The  lines 
connecting  the  points  represent  the  average  of  the  two 
weight  estimates.  The  solid  and  dashed  lines  represent  the 
equal  and  unequal  weighting  strategies,  respectively.  By 
comparing  figures  1  and  2 ,  it  can  be  seen  that  when  the  COSS 
functions  have  similar  slopes  the  weights  are  relatively 
equal.  Alternatively,  looking  at  the  graphs  on  the  right 
side   of   figure   1,  we  can  see   that   the   smaller  weights 
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Figure  1.   The  COSS  functions  derived  from  a  simulated 

observer  using  an  equal,  left  panels,  or  unequal,  right 
panels,  weighting  strategy.   The  top  functions  with  the 
squares  and  the  the  bottom  functions  with  the  circles 
represent  the  signal  and  noise  trials,  respectively. 
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Figure  2 .   The  weights  derived  from  the  COSS  functions 
depiected  in  Figure  1.   The  squares  and  circles 
represent  the  weight  estimates  for  the  signal  and  noise 
trials,  respectively.   The  solid  and  the  dashed  lines 
represent  the  average  of  signal  and  noise  weights  for 
the  equal  and  unequal  weighting  strategies,  respect- 
ively. 
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correspond  with  the  shallower  slopes  in  the  COSS  functions. 

The  actual  weights  depicted  in  Figure  2  are  based  on  the 
variance  of  a  cumulative  normal  distribution  which  had  the 
best  Chi-square  fit,  VAR[Y.],  to  the  observer's  COSS  func- 
tions, represented  by  the  solid  lines  in  figure  1.  (A 
detailed  description  of  Berg's  (1989)  theoretical  solution 
for  the  relative  weights  is  found  in  appendix  B) .  This 
estimated  variance  is  added  to  the  variance  of  the  distribu- 
tion from  which  the  items  were  sampled,  a  ?  .  Then,  to 
derive  the  relative  weights,  the  sum  of  the  variances  for 
each  source  is  divided  by  the  sum  of  the  variances  corre- 
sponding with  one  source  set  to  unity, 

n 

VARFY.]  +  a  ^     Sa?  a  ?  /  a^     a} 

'•  J  -I      ej         1   ei   '    J        k 
1 

-   -  (11) 

n 

VAR[Y.]  +   al  Sa?  a  ?  /  a^     a?. 

*•  k-'      ek         1   ei   '    k        j 
1 

Finally,  the  weights  are  normalized  such  that  Ea.  =  1. 

Note  that  the  choice  of  which  source  is  to  be  set  to 
unity  is  arbitrary.  That  is,  the  investigator  should  decide 
which  item  is  the  best  choice  given  the  hypothesis  that  is 
being  addressed.  For  instance,  Berg  and  Green  (1990)  used 
the  COSS  technique  in  an  auditory  profile  analysis  task.  A 
profile  task  involves  detecting  an  increment  in  the  level  of 
a  single  component  (tone)  among  a  multi-component  back- 
ground. Given  an  optimal  decision  strategy  that  compares 
the  mean  level  of  the  signal  component  to  the  mean  level  of 
the   nonsignal  components,  the  greater  the   difference   from 
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zero  the  more  likely  an  increment  was  added.  If  the  weight 
assigned  to  the  signal  component  is  set  to  unity,  then  the 
optimal  weighting  for  the  nonsignal  components  should  equal 
-l/(n-l)  (where  there  are  n  components).  For  this  experi- 
ment the  element  found  in  the  center  of  the  visual  field  at 
the  fixation  point,  x^,  was  set  to  unity. 

Given   the   observers'  estimated  weights.   Berg   (1990) 
shows   how  these  weights  can  be  incorporated  into  a   measure 
of   the  observers*  weighting  performance.   This   measure   is 
the  same  as  equation  10,  except  the  observer's  weights,   a^, 
are  used  instead  of  the  ideal  weights,  a., 

d'^g^  =  [6n.  Ha.]    /    [(Sa^a^p^^]  (12) 

1         1 

If   the  observer  applies  a  nonoptimal  weighting  pattern   the 

observer's  weighting  performance,  d'y-^,  will  be  lower  than 

that  of  an  ideal  observer,  d'.,  ,. 

'     ideal 

Furthermore,   we  can  obtain  a  measure  of  the   observer's 
overall   performance,  d'^^^^,  on  the  task  by   calculating   the 
absolute  value  of  the  difference  between  the  Z-scores  corre- 
sponding  with  her  hit  and  false  alarm  probabilities  on   the 
experimental  trials.   If  this  measure,  d'^^^^,  is  lower  than 
d'   ,   then  the   additional  loss   in  performance   can  be 
thought  of  as  the  effects  of  the  observer's  internal   noise, 
a^^^.   That  is,  unlike  an  ideal  decision  maker,  an   observer 
will  often  be  less  reliable  at  transferring  information  from 
the   environment  into  a  decision  statistic.    It  is   assumed 
that  internal  noise  is  independent  of  the  weight  estimates. 
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Finally,   once  the  three  performance  measures,   d '  ..  ,, 

-"  '^  '  ideal  ' 

d'   ,   and   d'^^^,  have  been  derived,   performance   can   be 
summarized  in  terms  of  an  efficiency  measure  (Tanner  & 
Birdsall,   1958).     Berg  (1990)  describes   observers'   per- 
formance  in  terms  of  three  efficiency  measures:  one   repre- 
senting  the  observers  overall  performance,   another   repre- 
senting  the  observers'  weighting  performance,  and   a   third 
representing   residual   factors  such  as  internal   noise.    A 
general  measure  of  the  observer's  overall  efficiency,  rj  .    , 
is  the  squared  ratio  of  her  performance,  d'^^^^,  relative   to 
the  performance  of  an  ideal  observer,  d'-^^^^.   That  is, 

^obs=  (d',,3/  d'.,^3^)^.  (13) 

If  the  observer  is  optimal,  ^^^s  ~  l-O-   Any  decrement  in  the 
observer's  performance  will  correspond  with  a  decrease  in 
efficiency,  where  0  <  Hq^s  "^  ^' 

The  other  two  efficiency  measures,  rj    ,  and  n  .  ,  allow 

-"  '   'wgt       'noise' 

US  to  separate  the  loss  in  observer  efficiency  due  to  non- 
optimal  weighting  from  loss  due  to  observer  internal  noise, 
respectively.   The  weighting  efficiency,  like  the  overall 
efficiency,  n^bs'  ^^  ^^^  measure  of  the  observer's  weighting 
performance,  d'  ^,  relative  to  the  ideal  observer,  d'.,  ,, 

*■  '     wgt  '  '     Ideal  ' 

who  uses  an  optimal  weighting  strategy: 

%,.=    (d'.g./  d',,,3,)^.  (14) 

'^noise  sccounts  for  any  additional  loss  in  d'^^^^  not  explained 
by  the  weights,. 

'7noise=  (^'o,s/     ^\,^)  '   '  d^) 
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The  relationship  among  these  measures  is 

Defining  the  Optimal  Observer  in  4AFC  Detection 
In   a   Four-Alternative-Forced-Choice   (4AFC)   task,   an 
observer   is  given  four  independent  sources  of   information, 
where   each   source   represents   one   of   two   alternatives, 
"signal"   or   "noise."   On  a  given  trial,  one   of   the   four 
elements  is  randomly  selected  to  represent  the  signal  event. 
This  source  value  is  drawn  from  a  normal  distribution  with  a 
mean   of  fx^  and  a  standard  deviation  of  a  .         The   remaining 
three   source   values  are  drawn  from  a   normal   distribution 
with  a  mean  of  /x  and  a  standard  deviation  of  a  ,    where  u  < 
u.     and  a    =   a    =   a^.      The  observer's  task  is   to   identify 

"^s  n     s     e  ■'■ 

which  of  the  four  sources  represents  the  signal.  Rather 
than  combining  the  information  to  make  a  single  response,  as 
with  the  Yes/No  task,  a  4AFC  task  requires  the  observer  to 
independently  assess  each  value  to  identify  which  source 
represents  the  signal. 

In  the  second  study,  there  are  four  informational 
sources,  represented  by  graphical  elements  located  in  four 
separate  spatial  position  in  the  visual  field.  For  the  4AFC 
task  there  are  four  possible  stimulus  sequences,  <s,n,n,n>, 
<n,s,n,n>,  <n,n,s,n>,  or  <n,n,n,s>,  where  <s,n,n,n>  repre- 
sents a  stimulus  value  in  the  first  spatial  position  and 
three  noise  values  in  the  second,  third,  and  fourth  spatial 
positions.  The  observer  has  to  determine  which  of  the  four 
spatial  orders  in  present  on  a  given  trial.   Thus,  there  are 
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four  possible  responses,  <S,N,N,N>,  <N,S,N,N>,  <N,N,S,N>,  or 
<N,N,N,S>,  corresponding  with  the  four  equally  likely  loca- 
tions where  the  signal  can  occur. 

Table   2   depicts  the  stimulus-response  matrix   for   the 
decision   task.   The  matrix  cells  falling   along   the   minor 
diagonal   represent   correct  responses.   T-  represents   the 
total  correct  responses  for  the  i^*^  ordering  of  the  stimuli, 
Sg. .   The  percentage  correct  in  a  4AFC  task,  P^(C),  is 

p,(c)  =  It./   n,^^3^,  (17) 

1 

where  N^^^^^  is  the  total  number  of  trials  across  all   stimu- 
lus orders. 

Green  (1992)  shows  that  an  ideal  observer,  who   attempts 
to  maximize  percent  correct,  will  choose  the  source  with  the 
largest  value  since  this  value  also  has  the  largest   likeli- 
hood ratio.   To  expedite  the  derivation.  Green  characterizes 
the  task  as  detection  of  1-of-m  possible  signals  relative  to 
noise   alone.   Using  this  approach,  each  sequence  would  be 
represented  as   a  separate  signal,  e.g.,  <s,n,n,n>   =  Sg^ . 
Thus,   the  likelihood  that  the  evidence,  x  =   <x^ ,  X2,  x^,  x^>, 
presented  on  a  given  trial  represents  the  i^^  signal   com- 
pared to  noise  alone  may  be  expressed  as  follows: 

l(x|Sg.)  =  EXP[x.((M3-Mj/a^^  )  -  ^.  (  (M^  "M^  )/cr^  )  ]  ,     (18) 
where   |i^  <  11^,    and  a^=   o^  =   o^.  x.  is   monotonically 

related  to  the  optimal  decision  statistic,  l(x|Sg.).  Thus, 
the  observer  should  choose  the  largest  value,  x.,  since  this 
value  also  has  the  largest  likelihood  ratio.   Green's  (1992) 
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Table  2. 
The   stimulus-Response   matrix  for  the  4 AFC  task.    The 
sequence  <s,n,n,n>  represents  a  stimulus  value  in  the   first 
spatial   position   and   three  noise  values   in   the   second, 
third,  and  fourth  spatial  postitions. 


Sgl 
<s,n, n, n> 

<n,: 

Sg2 
3,n,n> 

<n, 

Sg3 
n, s, n> 

Sg4 
<n, n,n, s> 

<S,N,N,N>  Tl 

<N,S,N,N> 

T2 

<N,N,S,N> 

T3 

<N,N,N,S> 

T4 
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derivations  of  the  optimal  decision  statistic  for  an  Four- 
Alternative-Forced-Choice  task  are  found  in  appendix  C.   An 
alternative  calculation  of  the  optimal  decision  statistic 
that  considers  the  sequences  as  four  separate  hypotheses 
yields  a  slightly  different  decision  statistic,  but  the  same 
decision  strategy.   That  is,  an  optimal  observer  should 
choose  the  source  with  the  largest  value  since  it  also  has 
the  largest  likelihood  ratio. 

The  accuracy  of  an  observer  using  this  predicted  optimal 
decision  strategy  depends  upon  the  probability  that  the 
largest  value  was  actually  sampled  from  the  signal  distribu- 
tion. That  is,  P^(C)  depends  on  the  probability  that  the 
sample  from  the  signal  distribution,  f(x|s),  is  greater  than 
the  samples  from  the  noise  distribution,  f(x|n)  .  Consider- 
ing two  alternatives,  the  probability  that  one  random  varia- 
ble is  larger  than  another  can  be  expressed  as  follows: 

u 


PjCC)  = 


00 

f(u|s) 

-00 


f(v|n)  dvdu.  (19) 


-00 


Equation  19  represents  the  probability  that  the  value  of  the 
noise  sample,  v,  is  less  than  the  value  of  the  signal  sam- 
ple, u,  summed  across  all  possible  values  of  u  (Green, 
1992).  Since  the  same  probability  density  functions,  f(x|s) 
and  f(x|n),  are  used  to  define  the  hit  and  false  alarm 
probabilities  found  in  a  Yes/No  ROC  curve,  it  is  possible  to 
relate  performance  in  an  m-Alternative-Forced-Choice  task  to 
performance  in  a  Yes/No  task.  That  is,  equation  19  can  be 
rewritten  to  produce  the  following  equation. 


P2(C)  = 


19 

1 

[1-P^  (S|n)]dPJs|s),  (20) 

0 


where  1-P  (s|n)  is  the  complement  of  the  false  alarm  proba- 
bility, and  -dp  (S|s)  is  the  derivative  of  the  complement  of 
the  hit  probability.  (See  Appendix  D  for  the  derivations.) 
Equation  2  0  shows  the  area  under  a  Yes/No  ROC  curve  is 
related  to  percent  correct  in  a  2AFC  task  (Green,  1992; 
Green  &  Swets ,  1966).  Finally,  equation  20  can  be  rewritten 
to  account  for  multiple  alternatives  as  follows, 

'1 


Pm(C)   = 


im-1 


[1-P^  (S|n)  ]""■'   dPJSis)  ,         (21) 
0 


where  m  >  2 . 

Thus,  it  is  possible  to  convert  a  percent  correct  value 
in  an  m-Alternative-Forced-Choice  task  to  a  Yes/No  d'  meas- 
ure. Hacker  and  Radcliff  (1979)  published  tables  which 
allow  us  to  make  conversions  from  percent  correct  in  an  m- 
Alternative-Forced-Choice  task  to  a  Yes/No  d'.  This  table 
takes  into  account  the  uncertainty  associated  with  larqer 
numbers  of  alternatives.  For  instance,  when  P2(C)  =0.8  in 
a  2 AFC  task  d'  =  1.19;  however,  in  a  4 AFC  task  the  same 
percent  correct,  P^(C)  =  0.8,  yields  a  d'  =  1.89.  Finally, 
given  the  relationship  between  performance  in  Yes/No  and 
mAFC  tasks,  maximum  percent  correct  in  a  4AFC  task  will  also 
be  limited  by  the  underlying  distribution  parameters  (Mac- 
Millian  &  Creelman,  1991) .  That  is,  when  percent  correct  in 
a  4AFC  task  is  converted  to  a  Yes/No  d',  performance  will 
not  exceed  d'  as  defined  in  equation  7. 
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The  preceding  definitions  of  the  optimal  observer  in 
both  a  Yes/No  and  4 AFC  detection  decision,  provide  a  base- 
line for  comparing  observer  performance  under  different 
experimental  conditions.  In  situations  where  observer 
performance  falls  short  of  the  ideal,  performance  may  be 
facilitated  by  presenting  the  information  in  some  manner 
which  helps  them  to  act  more  like  an  ideal  observer.  The 
two  studies,  to  be  described,  address  this  approach  to 
optimizing  human  performance.  That  is,  these  studies  look 
at  the  effects  of  selected  display  coding  factors  which  were 
designed  to  help  observers  function  as  optimal  observers  on 
their  detection  decisions. 


EXPERIMENT  1 
Introduction 

Visual  displays  are  commonly  used  to  convey  system 
information,  such  as  air  traffic  flow  or  the  status  of  a 
production  line,  to  a  human  decision  maker.  A  complex 
visual  display  may  include  several  subordinate  displays  or 
display  "elements."  Each  display  element  provides  a  poten- 
tial source  of  information  for  the  human  operator.  However, 
it  may  be  impossible  for  the  operator  to  obtain  useful 
information  from  more  than  a  few  of  the  display  elements  at 
one  time.  This  problem  may  be  minimized  if  the  operator  can 
prioritize  the  display  elements  in  terms  of  their 
criticality  and  inf ormativeness,  and  if  the  operator  can 
allocate  his  or  her  attention  accordingly.  This  study 
examined  several  factors  that  affect  an  operator's  ability 
to  allocate  attention  to  display  elements  that  are  differen- 
tially informative. 

In  a  previous  experiment  (Sorkin,  Mabry,  Weldon,  & 
Elvers,  1991) ,  observers  examined  a  multi-element  display 
and  then  reported  whether  the  display  represented  the  occur- 
rence of  a  signal  or  nonsignal  event.  Using  a  technique 
derived  from  the  Theory  of  Signal  Detectability  (TSD,  Green 
&  Swets,  1964),  Sorkin  et  al.  estimated  the  importance  or 
weight  the  observer  assigned  to  each  element  of  the   display 
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in  making  a  detection  decision  (Berg,  1989,  1990).  An 
optimal  decision-theoretic  observer  weights  the  input  from 
each  element  according  to  the  element's  informativeness  or 
reliability;  highly  reliable  display  elements  are  weighted 
more  highly  in  the  detection  decision  than  less  reliable 
elements  (Durlach,  Braida,  &  Ito,  1986;  Berg,  1990;  Berg  & 
Green,  1990) . 

In  the  Sorkin  et  al,  (1991)  study  all  display  elements 
were  equally  informative;  hence,  each  element  should  have 
been  weighed  equally  in  the  observers'  decisions.  When  the 
observation  durations  were  long,  the  weights  were  equal 
across  the  spatial  array  of  display  elements.  However,  when 
the  observation  durations  were  brief  and  the  display  coding 
was  complex,  the  highest  decision  weights  were  associated 
with  display  elements  in  the  center  of  the  visual  field, 
around  the  observer's  fixation  point.  The  extent  to  which 
the  weighting  functions  were  peaked  corresponded  with  the 
performance  level  (low  performance  was  associated  with 
peaked  functions).  Sorkin  et  al.  (1991)  concluded  from 
these  results,  that  under  difficult  conditions,  the  observ- 
er's allocation  of  attention  was  restricted  to  the  central 
portion  of  the  display. 

This  interaction  between  the  difficulty  of  the  task  and 
the  availability  of  information  from  different  regions  of 
the  display  is  not  surprising.  A  number  of  variables  are 
known  to  affect  an  observer's  ability  to  obtain  information 
from   the  elements  of  a  complex  visual  display.    These 
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include  the  number  (Perrott  et  al.;  1991)  and  spacing  (Andre 
&  Wickens,  1988)  of  irrelevant,  or  distracter,  items  found 
in  the  visual  field,  the  type  of  display  code  (Boles  & 
Wickens,  1987;  Legge,  Gu,  &  Luebker,  1989;  Sanderson,  Flach, 
Buttigieg,  &  Casey,  1989;  Sorkin  et.  al.,  1991),  display 
item  intensity  (Eriksen  &  Rohrbaugh,  1970) ,  and  task  com- 
plexity (Williams,  1982). 

When  the  stimulus  durations  in  the  Sorkin  et  al .  (1991) 
experiment  were  long  (more  than  500  ms) ,  all  display  element 
weights  were  equal,  indicating  that  the  observers  could 
process  information  from  all  regions  of  the  display.  Since 
the  reliability  of  all  the  elements  was  also  equal,  an  equal 
weight  strategy  was  optimal  for  that  task.  An  important 
question  is  whether  an  observer  can  employ  optimum  weights 
when  the  reliabilities  of  the  elements  are  not  equal  across 
the  visual  array.  Obviously,  the  ability  to  match  decision 
weights  to  the  element  reliability  is  necessary  if  the 
observer  is  to  prioritize  the  display  elements  according  to 
their   importance  to  the  task. 

When  an  informational  source  does  not  provide  a  consist- 
ent report  of  an  unchanging  event,  the  source  is  not  reli- 
able. For  instance,  if  a  sensor  measures  a  specific  lumi- 
nance value  to  be  x  at  one  time  and  x  ±  n  on  a  subsequent 
reading,  the  sensor  is  showing  variability  in  its  measure- 
ment. Thus,  this  sensor  would  be  less  reliable  than  one 
which  produces  a  consistent  measure  across  time.  A  person 
forming   a  decision  based  on  this  information   should   place 
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greater  weight  on  the  more  reliable  source.  However,  evi- 
dence suggests  that  people  tend  to  overrate  the  importance 
of  unreliable  sources  (Schum,  1975),  Wickens  (1984)  states 
that  when  people  are  confronted  with  sources  which  are  not 
equally  informative,  they  perform  the  task  "as  if"  all 
sources  were  equally  reliable. 

The  present  study  addressed  whether  observers  can  use 
differences  in  display  element  variability  to  identify 
source  reliability  and  use  this  information  in  forming  a 
simple  Yes/No  detection  decision.  In  addition,  this  study 
was  designed  to  determine  whether  using  this  information 
imposes  a  significant  amount  of  additional  processing 
"overhead"  on  the  observer,  and  whether  selected  display 
factors  could  reduce  related  performance  loss.  As  in  Sorkin 
et  al.  (1991),  the  observers  in  the  current  study  performed 
a  multi-channel  visual  detection  task.  On  each  trial  of  the 
experiment,  observers  were  presented  with  a  display  consist- 
ing of  nine  display  elements.  The  display  elements  were 
nine  vertical  line-graph  gauges  arranged  in  a  horizontal 
array  (see  figure  3) .  The  values  displayed  on  the  line- 
graph  gauges,  <x^ ,  x^,  .  .  .  ,  x^> ,  were  determined  by  inde- 
pendent, normally  distributed,  random  variables.  On  a 
signal  trial,  the  values  of  the  nine  elements  were  selected 
from  a  distribution  with  a  mean  of  jj,^  and  a  standard  devia- 
tion of  o^.  On  a  noise  trial,  the  values  were  drawn  from  a 
distribution  with  a  mean  of  u  and  a  standard  deviation  of 
a^,      where   jU^  <  ii^.      The  observer's   task  was   to   decide 
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Figure  3.  Demonstration  of  the  nine  graphical  elements  found 
in  experiment  1. 
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whether  the  data  displayed  had  been  generated  from  the 
signal  or  noise  distribution. 

The  reliability  of  different  display  elements  was  con- 
trolled by  manipulating  the  variance  of  the  distributions 
from  which  the  element  values  were  sampled:  high  reliability 
elements  were  sampled  from  distributions  with  lower  variance 
than  low  reliability  elements.  That  is,  a  source  high  in 
reliability  would  be  analogous  to  an  instrument  which  shows 
consistent  measurements  across  time  whereas  a  source  low  in 
reliability  would  not  provide  consistent  measurements.  The 
variance  of  a  display  element  at  a  particular  position  was 
the  same  for  signal  and  noise  trials,  but  differed  across 
elements  depending  on  the  experimental  condition.  Table  1 
illustrates  the  mean  and  standard  deviations  that  could  be 
employed  for  a  five  element  display  in  which  odd  and  even 
elements  alternated  in  their  level  of  reliability. 

Berg  (1990)  found  that  the  reliability  of  elements  in  an 
auditory  task  similar  to  the  one  used  in  this  study  could  be 
used  by  observers  when  the  most  reliable  tones  were  much 
louder  than  the  less  reliable  tones.  The  loudness  cue  was 
much  less  effective  when  reversed,  in  which  case  a  louder 
cue  indicated  a  lower  reliability.  Berg's  results  suggest 
that  under  some  conditions  cuing  element  reliability  (e.g., 
with  intensity  or  color)  may  aid  observers  in  accurately 
weighting  display  sources  by  their  importance. 

Cues  such  as  size,  intensity,  color,  and  movement  are 
often   incorporated  in  display  design  to  draw   attention   to 
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specific  items  in  a  display.  For  instance,  researchers  have 
found  that  correct  utilization  of  color  coding  (Christ, 
1990;  Fisher  &  Tan,  1989)  can  reduce  search  time  in  locating 
an  item  in  a  display.  Furthermore,  Wickens  and  Andre  (1990) 
showed  that  color  coding  a  particular  item  in  an  object 
display  leads  to  improved  accuracy  in  recalling  the  specific 
value  associated  with  that  item  relative  to  a  monochromatic 
display.  Thus,  given  Berg's  results  and  the  evidence  cited 
above,  we  predicted  that  observer  weighting  efficiency  in 
the  present  experiment  should  be  higher  for  a  condition  in 
which  a  luminance  cue  signals  the  element  reliability. 

In  order  to  test  the  efficacy  of  a  cue  for  element 
reliability  in  the  present  experiment,  the  spatial  position 
of  the  high  reliability  display  elements  was  randomly  varied 
over  trials.  The  overall  luminance  of  the  display  element 
varied  in  accordance  with  the  reliability  (high  or  low)  of 
the  element.  We  expected  that  luminance  would  provide  a 
natural  code  for  allocating  observer  attention  and  hence 
weight,  to  the  high  reliability  elements.  If  that  were  the 
case,  the  efficiency  of  the  observers'  weighting  strategy 
would  be  much  higher  in  a  cued  than  in  an  uncued  condition. 

The  duration  of  the  stimulus  and  the  spatial  arrangement 
of  the  element  reliabilities  also  should  influence  how 
efficiently  the  observers  match  their  weights  to  the  element 
reliabilities.  The  results  from  the  Sorkin  et  al.  (1991) 
study  suggested  that  233-ms  was  sufficient  time  for  observ- 
ers  to  utilize  information  from  as  many  as   nine,   equally 
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reliable,  graphically  coded  display  elements.  However,  it  is 
possible  that  sensing  the  element  reliabilities  and  differ- 
entially weighting  the  elements,  may  require  some  additional 
processing  steps  or  "overhead"  by  an  observer.  A  duration 
of  233-ms  may  be  at  the  margin  of  an  observer's  ability  to 
extract  the  information  needed  to  discriminate  and  employ 
differences  in  element  reliability.  For  example,  a  slower, 
serial  search  may  be  required  to  extract  the  reliability 
information  and  weight  the  elements  accordingly.  In  that 
case,  it  might  be  advantageous,  when  processing  short  dura- 
tion stimuli,  to  ignore  reliability  and  differential  weight- 
ing information.  Our  experiments  tested  three  levels  of 
stimulus  duration  (150,  400  and  800-ms) .  We  expected  that 
weighting  efficiency  would  be  greatest  at  long  stimulus 
durations  (400-ms  and  800-ms)  and  very  poor  at  the  shortest 
duration  (150-ms). 

Observer  sensitivity  to  element  reliability  also  may  be 
affected  by  the  spatial  arrangement  of  element  reliability. 
If  attention  is  distributed  more  effectively  among  spatially 
contiguous  than  separated  items,  grouping  sources  similar  in 
reliability  should  aid  performance.  Posner,  Snyder,  and 
Davidson  (1980)  found  that  simple  reaction  time  to  detect  a 
light  at  a  second  most  likely  position  was  facilitated  only 
when  this  item  was  adjacent  to  a  cued  location  (the  most 
likely  target  location) .  When  the  second  most  likely  posi- 
tion was  separated  by  more  than  one  location,  detection 
speed  was  not  facilitated.   Thus,  the  weighting   efficiency 
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should  be  better  for  displays  with  elements  grouped  by 
similar  reliabilities  than  for  displays  that  distribute 
element  reliabilities  across  the  array. 

Finally,  we  were  interested  in  whether  observers  would 
be  sensitive  to  the  reliability  of  individual  elements 
without  any  cues  to  element  reliability.  That  is,  can 
observers  estimate  (and  employ)  information  about  element 
reliability  based  only  on  the  trial-by-trial  variability  of 
the  readings  from  individual  display  elements  and  feedback 
about  the  S/N  events?  To  answer  that  question,  we  added 
conditions  in  which  the  relationship  between  element  spatial 
position  and  reliability  was  fixed,  rather  than  random, 
over  a  block  of  200  trials.  If  the  observer  can  estimate 
the  variance  of  the  element  readings  from  the  first  k  trials 
of  a  block,  the  observer  may  be  able  to  partition  the  ele- 
ments into  those  with  high  and  low  reliabilities.  If  that 
process  led  to  the  assignment  of  higher  weights  to  the  more 
reliable  elements,  the  observer's  weighting  efficiency  would 
be  enhanced  in  that  condition. 

Method 
Subjects 

Four  University  of  Florida  students  with  normal,  or 
corrected  to  normal,  visual  acuity  participated  in  this 
study.  One  subject,  S2,  was  later  discovered  to  be  color 
deficient,  and  another,  S4,  was  highly  trained  on  the  task. 
They  were  paid  an  hourly  wage  plus  a  bonus  based  on  perform- 
ance. 
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Apparatus  and  Stimuli 

Observers  were  seated  in  a  sound  isolated  booth  approxi- 
mately 27  inches  away  from  a  10.5  inch  color  monitor  (EGA) 
driven  by  an  80386  computer.  The  monitor  was  set  for  maxi- 
mum contrast,  and  intensity  was  set  at  approximately  102 
cd/m^  ,  measured  from  a  7.5  inch  by  4  inch  uniform  white 
field.  On  a  given  trial,  nine  gauges  were  presented  on  the 
monitor;  subtending  a  horizontal  by  vertical  visual  angle  of 
approximately  15°  by  8 ° .  Each  gauge  was  composed  of  two 
parallel  white  lines,  with  tick-marks  falling  at  equal 
intervals  on  the  left  line  for  all  conditions  except  the 
luminance  cue  condition.  For  this  latter  condition  high 
reliability  gauges  were  white  and  the  remaining  gauges  were 
gray.  The  intensity  of  the  white  gauges  was  approximately 
102  cd/m^  and  the  intensity  of  the  gray  gauges  was  approxi- 
mately 22  cd/m^  measured  from  7.5  inch  by  4  inch  uniform 
white  and  gray  fields,  respectively. 

Each  tick-mark  represented  a  display  increment  of  1.0, 
and  ranged  from  0.0  to  10.0.  Two  longer  blue  lines,  located 
near  the  tick-marks,  indicated  the  positions  of  the  signal 
and  noise  distribution  means.  The  value  displayed  by  each 
gauge  was  determined  by  sampling  a  number  from  either  a 
"signal"  or  "noise"  distribution,  depending  on  the  type  of 
trial.  This  number  was  converted  to  the  vertical  displace- 
ment of  a  horizontal  white  line  from  the  bottom  (e.g.  zero 
position)  of  the  gauge  (see  figure  3) .  The  gauge  values 
were  drawn  from  the  signal  distribution  on  50  percent  of  the 
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trials.  The  mean  of  the  gauge  values  on  signal  trials,  /x^, 
was  equal  to  5.0;  the  mean  on  noise  trials,  ^r,/  ^^s  equal  to 
4.0. 

The  standard  deviation  of  the  gauge  values  on  signal  and 
noise  trials  depended  on  the  particular  experimental  condi- 
tion. There  were  five  different  element  reliability  condi- 
tions: (1)  Equal,  (2)  Grouped-Left-High,  (3)  Grouped-Right- 
High,  (4)  Distributed-Even-High,  and  (5)  Distributed-Odd- 
High.  In  the  Equal  condition,  the  standard  deviation  of  all 
gauge  elements  was  equal  to  1.  In  the  Grouped-Left-High 
condition,  the  standard  deviation  of  the  four  left  elements 
was  equal  to  0.75,  and  the  five  right  elements  was  equal  to 
1.5.  That  pattern  was  reversed  in  the  Grouped-Right-High 
condition.  In  the  Distributed-Even-High  condition,  the 
standard  deviation  of  the  four  even  elements  (element  2,  4, 
6,  and  8)  was  equal  to  0.75,  and  the  standard  deviation  of 
the  remaining  elements  was  equal  to  1 . 5 .  In  the  Distribut- 
ed-Odd-High  condition,  the  standard  deviation  of  the  five 
odd  elements  (element  1,  3,  5,  7,  and  9)  was  equal  to  0.85, 
and  the  standard  deviation  of  the  remaining  elements  was 
equal  to  1.3. 

The  unequal  reliability  conditions  were  run  under  two 
different  trial  block  conditions:  Pure  Block  and  Mixed 
Block.  In  the  Pure  Block  condition,  all  display  and  distri- 
bution parameters  were  fixed  within  a  block  of  200  trials. 
Thus,  in  four  conditions  (Grouped-Left-High,  Grouped-Right- 
High,  Distributed-Even-High,  and  Distributed-Odd-High) ,   the 
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relationship  between  element  reliability  and  spatial  posi- 
tion was  fixed  throughout  the  block  of  trials.  In  the  Mixed 
Block  conditions,  the  trials  within  a  block  of  200  trials 
alternated  randomly  among  the  Grouped-Left-High,  Grouped- 
Right-High,  Distributed-Even-High,  and  Distributed-Odd-High 
conditions.  In  the  Mixed  Block  conditions,  it  would  be 
impossible  for  an  observer  to  identify  the  reliability  of 
any  given  spatial  element,  unless  the  observers  were  provid- 
ed with  an  additional  trial-by-trial  cue  to  element  reli- 
ability. Finally,  all  trial  block  conditions  were  tested  at 
three  levels  of  stimulus  duration  (150,  400,  and  800  ms) . 

The  duration  of  the  stimulus  presentation  was  synchro- 
nized with  the  refresh  traces  of  the  monitor.  The  period 
between  traces  was  approximately  17  ms.  The  onset  and 
offset  of  the  display  was  delayed  until  a  retrace  was  ready 
to  occur.  Once  the  stimulus  was  presented  the  duration  was 
controlled  by  counting  the  number  of  refresh  traces  which 
corresponded  with  the  selected  stimulus  duration  (150,  400, 
or  800  ms) . 

The  experimental  conditions  are  shown  in  table  3.  The 
mnemonics  in  each  table  cell  describe  the  trial-block  condi- 
tions. The  three  trial-block  conditions  which  contained 
elements  which  differed  in  reliability  across  spatial  posi- 
tions are  denoted  by  the  letter  U  in  the  mnemonic,  meaning 
the  sources  were  unequal  in  reliability.  The  equal  reli- 
ability condition  is  denoted  by  the  letter  E  in  the  mnemon- 
ic.  As  demonstrated  in  the  table  all  trial-block  conditions 
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Table  3. 
The  mnemonics  for  experimental  conditions  found  in 
experiment  1. 


UNCUED 
150    400 

800 

CUED 
150    400 

800 

MIXED  BLOCK 

LEFT 

UNcM 

UCM 

RIGHT 

ODD 

EVEN 

PURE  BLOCK 

LEFT 

UNcP 

RIGHT 

ODD 

EVEN 

EQUAL  RELIABILITY 

ENcP 
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were  run  at  the  three  levels  of  stimulus  duration.  In 
addition,  within  each  of  the  unequal  reliability  trial-block 
conditions  the  four  element  reliability  arrangements  were 
presented. 

The  next  two  letters  in  the  mnemonics  represent  whether 
or  not  a  luminance  cue  was  present  (C  =  cue  and  No  =  No 
cue) .  Finally,  the  last  letter,  P  or  M,  denotes  the  manner 
in  which  the  element  reliability  arrangements  were  present- 
ed. The  P  represents  a  pure  block  design  in  which  the 
arrangements  remained  constant  across  experimental  trials  in 
a  given  block,  and  the  M  represents  a  mixed  block  design  in 
which  the  arrangements  varied  across  trials.  Thus,  the 
mnemonic  UNcM  stands  for  an  Unequal  reliability  No  cue  Mixed 
block  design. 
Procedure 

Observers  were  told  to  make  their  decisions  based  on  the 
level  of  the  gauges  relative  to  the  signal  and  noise  mean 
markers.  They  were  told  to  rank  the  likelihood  that  the 
evidence  represented  a  signal  by  using  the  "4",  "3",  "2"  and 
"1"  keys,  where  "4"  represented  very  sure  it  was  a  signal 
and  "1"  represented  noise.  In  fact,  observers  tended  only 
to  use  the  two  middle  keys.  Thus,  responses  on  keys  "1"  and 
"2"  were  combined  to  represent  noise,  and  responses  on  keys 
"3"  and  "4"  responses  were  combined  to  represent  signal  in 
the  data  analyses.  On  conditions  where  the  reliabilities 
differed  across  elements,  observers  were  informed  that  the 
least  variable  gauges  were  the  most  reliable. 


35 

The  trial  sequence,  shown  in  figure  4,  proceeded  as 
follows.  First,  observers  were  given  a  0.5"  by  0.5"  fixa- 
tion cross  at  the  center  of  the  display  for  200  ms .  This 
was  replaced  by  the  nine  line-graph  gauges  for  a  stimulus 
duration  of  either  150,  400,  or  800  ms.  Following  the 
stimulus  a  white  blanking  mask  was  presented  for  200  ms. 
Then,   the   display  was  completely  black  for   1   second,   at 

which  time  the  observers  were  allowed  to  respond.  Any 
responses  made  prior  to  or  following  this  period  were  dis- 
carded as  "No  Response"  trials.  Finally,  the  observers  were 
given  feedback  at  the  center  of  the  display  for  2  50  ms. 
Within  a  given  session,  an  observer  ran  through  10  blocks  of 
200  trials.  Across  sessions  there  were  1500  trials  (750 
signal  and  750  noise)  collected  for  each  condition. 

Due  to  time  constraints  imposed  by  the  need  to  collect 
multiple  trials,  some  of  the  observers  received  less  prac- 
tice than  others.  Subject  S4  was  highly  practiced.  He  ran 
through  at  least  eight  practice  sessions  for  each  condition 
prior  to  collection  of  the  experimental  trials.  Subjects 
SI,  S2,  and  S3  were  highly  practiced  on  the  Yes/No  detection 
task,  but  they  only  ran  through  one  practice  session  for 
each  of  the  individual  conditions. 

To  control  for  any  possible  practice  effects  in  the 
experimental  sessions,  each  observer  received  a  different 
order  of  four  trial  block  conditions,  organized  such  that 
across  subjects  each  condition  occurred  once  in  each  the 
four  possible  positions  in  the  order. 


36 


Fixation: 
200  ms 

-\- 

Stimulus 

Duration: 

150,400  or  800  ms 

~ 

— 

Mask: 
200  ms 

White   Field 

Response 
Duration: 
1000  ms 

Black   Field 

Feedback: 
250  ms 

Correct 

Figure  4.   Trial  sequence  for  the  first  experiment, 
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Results 

Average  observer  performance  measures  (d'^^^)  for  the 
experimental  conditions  are  shown  in  table  4 .  In  order  to 
consider  differences  between  the  equal  and  unequal  reliabil- 
ity block-type  conditions  the  data  were  collapsed  across 
source  reliability  arrangements  for  the  unequal  reliability 
conditions.  An  analysis  of  variance  performed  on  the  aver- 
age d '  ^^g  showed  a  significant  main  effect  of  stimulus 
duration  (F (2 , 6) =12 . 49 ,  p  <  0.01).  Performance  improved  as 
stimulus  duration  increased.  There  was  also  a  main  effect 
of  block-type  condition  (F (3 , 9) =5 . 285 ,  p  <  0.05).  All  four 
observers  showed  greater  performance  in  the  equal  reliabili- 
ty condition  relative  to  the  two  unequal  reliability  condi- 
tions which  did  not  include  a  luminance  cue  to  the  more 
reliable  sources  (UNcM  and  UNcP) . 

An  analysis  performed  on  the  observers  d '  ^j^^  measures  for 
the  unequal  reliability  conditions  indicated  significant 
effects  for  all  of  the  experimental  variables  (block-type 
condition,  stimulus  duration,  and  arrangement) ,  and  their 
interactions,  except  for  the  three-way  interaction.  Howev- 
er, only  a  few  of  these  differences  were  evident  in  the  data 
of  the  individual  subjects.  All  observers  showed  a  perform- 
ance improvement  as  stimulus  duration  increased 
(F(3 , 6) =13 . 66 ,  p<0.01).  There  was  also  a  performance  advan- 
tage for  the  cued  block-type  condition,  UCM,  relative  to  the 
UNcM  and  UNcP  conditions  (F (3 , 9) =3 . 86 ,  p<0.05). 


38 


Table  4. 
Average  observer  performance  (d')   for  each  experimental 
condition. 


I 
150 

JNCUED 
400 

800 

CUED 
150    400 

800 

MIXED  BLOCK 

LEFT 

1.92 

2.11 

2.12 

2.07 

2.37 

2.40 

RIGHT 

1.77 

1.89 

2.08 

1.90 

2.21 

2.34 

ODD 

2.06 

2.22 

2.28 

1.92 

2.06 

2.14 

EVEN 

1.93 

2.02 

2.18 

2.01 

2.20 

2.23 

PURE  BLOCK 

LEFT 

1.77 

2.05 

2.14 

RIGHT 

1.68 

1.92 

2.04 

ODD 

1.94 

2.07 

2.06 

EVEN 

1.74 

1.98 

2.03 

EQUAL  RELIABILITY 

2.24 

2.44 

2.57 

X  4  OBSERVERS 
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Finally,  there  was  consistency  among  the  observers  for 
two  of  the  interactions,  as  well.  For  the  four  arrangements 
there  was  a  tendency  for  observer  performance  to  be  highest 
when  the  most  reliable  sources  were  grouped  in  the  four  left 
spatial  positions  at  stimulus  durations  of  400  ms  or 
greater.  Alternatively,  at  the  shortest  stimulus  duration 
performance  was  highest  when  the  most  reliable  sources  were 
distributed  among  the  odd  spatial  positions.  The  effects  of 
arrangement  also  depended  on  the  particular  condition.  In 
general,  observers  showed  a  performance  advantage  for  the 
Odd  arrangement  over  the  other  three  arrangements  in  the 
two,  no  luminance  cue  conditions  (UNcM  and  UNcP) .  Alterna- 
tively, the  grouped  left  arrangement  yielded  the  greatest 
observer  performance  in  the  mixed  block,  cued  condition 
(UCM) .  Moreover,  the  right  arrangement  tended  to  show  the 
poorest  performance  in  the  UNcM  and  UNcP  conditions,  but 
relatively  high  performance  in  the  UCM  condition.  All  of 
these  differences  were  found  to  be  significant  through 
subsequent  paired  comparisons  using  a  Tukey  test. 

Thus,  the  evidence  from  these  analyses  indicated  that 
stimulus  duration  and  block-type  condition  had  a  consistent 
effect  on  observer's  performance.  In  addition,  the  arrange- 
ment of  source  reliabilities  influenced  observers'  perform- 
ance. However,  the  direction  of  effects  depended  on  the 
stimulus  duration  and  the  block-type  condition.  Performance 
was  greatest  when  the  stimulus  duration  was  at  least  400  ms 
and   sources   were  equal  in   reliability.    Considering   the 
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unequal  reliability  patterns  alone,  performance  was  best 
when  sources  high  in  reliability  were  cued  and  grouped. 
However,  performance  was  also  relatively  high  for  the  odd 
arrangement  in  the  UNcM  condition.  Since  the  location  of 
reliable  elements  in  the  visual  field  affected  performance 
differently  under  specific  conditions,  it  is  logical  to 
suspect  that  the  differences  in  performance  are  related  to 
observers'  weighting  strategies.  For  example,  when  observ- 
ers are  under  time  constraints  or  there  is  uncertainty  about 
the  location  of  reliable  sources,  observers  may  be  less 
efficient  at  applying  weights  appropriate  to  the  weighting 
strategy  selected. 
Weight  Analysis 

Observers'  weights  were  estimated  using  Berg's  (1989, 
1990)  Conditional-On-A-Single-Stimulus  (COSS)  analysis 
technique,  described  earlier.  The  estimated  weights  were 
based  on  the  slopes  of  cumulative  normal  functions  that  had 
the  best  Chi-Square  fit  with  the  corresponding  COSS  func- 
tions. Two  weight  estimates  were  calculated  for  each  ele- 
ment in  each  condition,  one  for  signal  and  one  for  noise 
trials.  In  this  analysis,  out  of  2808  COSS  functions  Chi- 
square  fitted  to  cumulative  normals,  6.7%  significantly 
differed  (p  <=  0.05)  from  the  observers'  COSS  functions. 
The  weights  reported  are  the  average  of  the  signal  and  noise 
weight  estimates. 

Table  5  lists  the  mean  weighting  efficiencies  derived 
from   the   weight  estimates  of  the  conditions  found   in   the 


41 


Table  5. 
Observer  Weighting  Efficiency  Estimates  for  Stimulus 
Duration  and  Condition. 


Stimul 

us  Durati 

on 

Subject 

Condition 

150 

400 

800 

Average 

SI: 

ENcP 

0.650 

0.780 

0.850 

0.760 

UNcM 

0.540 

0.645 

0.683 

0.622 

UNcP 

0.675 

0.755 

0.797 

0.743 

UCM 

0.735 

0.835 

0.850 

0.807 

S2: 

ENcP 

0.720 

0.820 

0.790 

0.777 

UNcM 

0.485 

0.563 

0.633 

0.560 

UNcP 

0.510 

0.617 

0.627 

0.585 

UCM 

0,607 

0.740 

0.795 

0.714 

S3: 

ENcP 

0.710 

0.830 

0.860 

0.800 

UNcM 

0.657 

0.688 

0.705 

0.683 

UNcP 

0.495 

0.650 

0.680 

0,608 

UCM 

0.715 

0.758 

0.802 

0.758 

S4: 

ENcP 

0.940 

0.960 

0.970 

0.957 

UNcM 

0.792 

0.745 

0.780 

0,772 

UNcP 

0.727 

0.770 

0.740 

0.746 

UCM 

0.900 

0.958 

0.960 

0.939 

Avg: 

ENcP 

0.750 

0.848 

0.868 

0.823 

UNcM 

0.619 

0.660 

0.700 

0.660 

UNcP 

0.602 

0.698 

0.711 

0.670 

UCM 

0.739 

0.822 

0.852 

0.805 

Equal  reliabilty  No  cue  Pure  block  design  (ENcP) 
Unequal  reliability  No  cue  Mixed  block  design  (UNcM) 
Unequal  reliability  No  cue  Pure  block  design  (UNcP) 
Unequal  reliability  Cue  Mixed  block  design  (UCM) 
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first  ANOVA  described  earlier.  Again,  for  the  three  unequal 
reliability  conditions  the  efficiencies  represent  the  aver- 
age of  the  four  arrangements.  Across  all  observers  and 
conditions,  weighting  efficiency  increased  as  the  stimulus 
duration  increased.  To  identify  whether  these  differences 
were  significant,  a  Monte  Carlo  simulation  was  run  to  esti- 
mate the  expected  variance  for  rj^  ^.  The  sampling  distribu- 
tion of  r^^g^  which  best  reflected  the  observers  weighting 
efficiencies  was  used  to  identify  significant  differences. 

The  criterion  selected  for  significant  differences  was 
two  standard  deviations  (c^^j^  =  0.04)  of  this  sampling 
distribution.  Differences  in  weighting  efficiency  which 
exceeded  0.08  were  identified  as  significant  differences. 
This  was  a  fairly  conservative  estimate,  considering  that 
these  distribution  parameters  corresponded  best  with  the 
data  from  the  poorest  observer.  Given  this  criterion, 
weighting  efficiency  was  significantly  greater  for  stimulus 
durations  of  400  ms  or  higher,  and  efficiency  was  highest  at 
800  ms.  Thus,  significant  improvements  in  weighting  effi- 
ciency at  least  partially  account  for  the  improvement  in 
overall  accuracy  found  when  stimulus  duration  increased. 

Moreover,  differences  among  the  observers'  weighting 
efficiencies  for  the  block-type  conditions  are  consistent 
with  the  differences  observed  in  the  overall  d'  ^  measures. 

ODS 

Performance  was  greatest  in  the  ENcP  and  UCM  conditions. 
Both  show  a  significant  advantage  over  the  other  two  unequal 
reliability   conditions   (UNcM  and  UNcP) .   This   pattern   is 
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maintained   at  all  stimulus  durations  and  fairly   consistent 
across  the  four  observers. 

Since  performance  was  highest  at  the  longest  stimulus 
duration,  and  differences  among  the  conditions  were  consist- 
ent across  all  levels  of  stimulus  duration,  the  figures 
depicting  the  separate  observers'  weights  for  the  unequal 
reliability  conditions  are  based  on  the  data  obtained  at  the 
800  ms  stimulus  duration,  only.  Figures  5-16  show  the 
observers'  average  weights  for  the  four  arrangements  of 
source  reliability  and  the  three  unequal  reliability  block- 
type  conditions.  There  are  three  figures  representing  the 
three  block-type  conditions  (UNcM,  UNcP,  and  UCM)  for  each 
observer.  In  each  figure  there  are  four  graphs  representing 
the  four  source  reliability  arrangements,  where  a,  b,  c,  and 
d  represent  the  left,  right,  even,  and  odd  arrangements, 
respectively.  The  larger  symbols  are  the  average  weight 
estimates  and  the  smaller  symbols  are  the  signal  and  noise 
weight  estimates.  The  solid  line  represents  the  optimal 
weights  for  the  separate  arrangements.  All  four  observers 
show  similar  changes  in  their  weights  across  the  three 
conditions,  where  their  weights  best  match  the  ideal  weights 
for  the  UCM  condition. 

Table  6  lists  the  observers'  weighting  efficiencies  for 
the  conditions  found  in  figures  5-16.  Again,  only  the  800 
ms  duration  is  shown.  Table  7  shows  the  average  weighting 
efficiencies  for  stimulus  duration,  arrangement,  and  block- 
type  condition.   As  with  the  average  data,  the  UCM  condition 
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Figure  5.  Subject  Si's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  6.   Subject  Si's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcP  block-type  condi- 
tion.   Panels  a,  b,  c,  and  d  are  the  left,  right,   even 
and  odd  arrangements,  respectively.   The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,   and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  7.   Subject  Si's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UCM  block-type   condi- 
tion.  Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.   The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,   and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  8.  Subject  S2's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  9.   Subject  S2's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcP  block-type  condi 
tion.    Panels  a,  b,  c,  and  d  are  the  left,  right,   even 
and  odd  arrangements,  respectively.   The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,   and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  10.  Subject  S2's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UCM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  11.  Subject  S3's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  12,  Subject  S3's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcP  block-type  condi 
tion.  Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  13.  Subject  S3's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UCM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  14.  Subject  S4's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  15.  Subject  S4's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UNcP  block-type  condi 
tion.  Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Figure  16.  Subject  S4's  average  weights  for  the  four  source 
reliability  arrangements  in  the  UCM  block-type  condi- 
tion. Panels  a,  b,  c,  and  d  are  the  left,  right,  even 
and  odd  arrangements,  respectively.  The  smaller  symbols 
are  the  separate  signal  and  noise  weight  estimates,  and 
the  solid  lines  represent  the  optimal  weights. 
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Table  6. 
Weighting  Efficiency  Estimates  for  Arrangement  and 
the  Unequal  Reliability  Trial-Block  Conditions. 


Subject 

Arrangement 

UNcM 

UNcP 

UCM 

Si: 

Left 

0,740 

0.880 

0.940 

Right 

0.580 

0.820 

0.940 

Even 

0.610 

0.690 

0.790 

Odd 

0.800 

0.800 

0.730 

S2: 

Left 

0.650 

0.620 

0.860 

Right 

0.560 

0.490 

0.850 

Even 

0.630 

0.640 

0.780 

Odd 

0.690 

0.760 

0.690 

S3: 

Left 

0.700 

0.700 

0.770 

Right 

0.570 

0.570 

0.780 

Even 

0.760 

0.750 

0.810 

Odd 

0.790 

0.700 

0.850 

S4: 

Left 

0.710 

0.840 

0.970 

Right 

0.780 

0.730 

0.970 

Even 

0.730 

0.600 

0.950 

Odd 

0.900 

0.790 

0.950 

Avg: 

Left 

0.700 

0.760 

0.885 

Right 

0.623 

0.653 

0.885 

Even 

0.683 

0.670 

0.833 

Odd 

0.795 

0.763 

0.805 

Equal  reliabilty  No  cue  Pure  block  design  (ENcP) 
Unequal  reliability  No  cue  Mixed  block  design  (UNcM) 
Unequal  reliability  No  cue  Pure  block  design  (UNcP) 
Unequal  reliability  Cue  Mixed  block  design  (UCM) 
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Table  7. 
Average  weighting  efficiency  estimates  for  the 
experimental   variables  (block-type  condition, 
arrangement  and  stimulus  duration) . 


Stimulus 

Duration 

Arrangement 

UNcM 

UNcP 

UCM 

150 

Left 

0.640 

0.633 

0.823 

Right 

0.520 

0.545 

0.757 

Even 

0.643 

0.620 

0.743 

Odd 

0.673 

0.610 

0.635 

400 

Left 

0.668 

0.730 

0.845 

Right 

0,585 

0.675 

0.890 

Even 

0.660 

0.658 

0.815 

Odd 

0.728 

0.730 

0.740 

800 

Left 

0.700 

0.760 

0.885 

Right 

0.623 

0.653 

0.885 

Even 

0.683 

0.670 

0.833 

Odd 

0.795 

0.763 

0.805 

AVG 

Left 

0.669 

0.708 

0.851 

Right 

0.576 

0.624 

0.844 

Even 

0.662 

0.649 

0.797 

Odd 

0.732 

0.701 

0.727 

Equal  reliabilty  No  cue  Pure  block  design  (ENcP) 
Unequal  reliability  No  cue  Mixed  block  design  (UNcM) 
Unequal  reliability  No  cue  Pure  block  design  (UNcP) 
Unequal  reliability  Cue  Mixed  block  design  (UCM) 
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shows  higher  weighting  efficiency  than  the  other  two  condi- 
tions, UNcM  and  UNcP,  for  each  arrangement  and  stimulus 
duration,  except  when  the  most  reliable  elements  were  found 
in  the  Odd  positions.  Table  6  indicates  that  all  four 
observer  show  the  same  pattern  of  effects  at  the  800  ms 
stimulus  duration.  They  tended  to  show  a  little  more  varia- 
bility, but  similar  patterns  across  subjects  at  the  two 
shorter  stimulus  durations. 

As  with  the  d'^^^^  data  we  see  a  slight,  but  not  signifi- 
cant, advantage  for  grouped  arrangements  in  the  UCM  condi- 
tion, and  for  the  odd  arrangement  in  the  UNcM  condition 
relative  to  the  other  arrangements.  The  interaction  between 
stimulus  duration  and  arrangement  found  in  the  d '  ^j^g  data 
was  not  supported  by  differences  in  observers  weighting 
efficiencies. 

From  table  6  we  also  see  that  the  less  practiced  observ- 
ers (SI,  S2,  and  S3)  tend  to  show  the  lowest  weighting 
efficiency  in  the  two  non-cued  conditions,  UNcM  and  UNcP, 
for  the  right  arrangement.  The  same  is  true  for  the  shorter 
durations.  However,  once  the  cue  was  introduced,  efficiency 
was  indistinguishable  for  the  weighting  efficiency  for  the 
left  arrangement  which  was  consistently  fairly  high. 
Finally,  as  far  as  the  observers'  residual  efficiency,  H^^^i 
were  concerned,  the  only  consistent  difference  among  the 
observers  was  a  tendency  for  residual  efficiency  to  be 
higher  for  the  equal  reliability  condition  relative  to  the 
three  unequal  reliability  block-type  conditions.   The  stimu- 
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lus  duration  by  arrangement  interaction  found  in  the  observ- 
ers' d '  ^^jg  measures  was  not  driven  exclusively  by  changes  in 
ri        or   T7  .  ,  but  rather  by  a  combination  of   these   ef- 

'wgt        'noise'  -' 

fects. 

Discussion 

The  primary  goal  of  this  investigation  was  to  determine 
the  effects  of  selected  display  factors  in  directing  observ- 
ers' attention  to  informative  sources.  There  was  an  overall 
improvement  in  observer  performance  as  stimulus  duration 
increased;  this  was  mainly  a  function  of  improved  weighting 
efficiency.  In  general,  when  no  additional  cues  to  source 
reliability  were  available,  weighting  efficiency  was  great- 
est when  sources  were  equal  rather  than  unequal  in  reliabil- 
ity. There  was  a  tendency  for  better  performance  when  more 
reliable  sources  were  grouped,  rather  than  distributed  in 
the  cued  block-type  condition,  especially  when  the  stimulus 
duration  was  at  least  400  ms.  Alternatively,  in  the  no  cue 
block-type  conditions  performance  was  highest  when  observers 
were  presented  a  distributed  odd  arrangement.  These  differ- 
ences in  performance  were  mostly  due  to  differences  found 
in  the  observers'  weighting  efficiency  measures. 

When  sources  have  to  be  prioritized  in  terms  of  the 
underlying  statistical  properties  of  the  information,  as  in 
this  study,  people  are  limited  by  both  their  ability  to 
estimate  these  properties  (e.g.,  variability  of  the  source) 
and  by  their  ability  to  then  weight  the  sources  accordingly. 
The   observers'  relatively  poor  performance  in  the  non-cued 
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conditions  may  have  been  due  to  their  inability  to  estimate 
the  variability  of  the  sources  when  this  information  was 
relevant  to  their  decisions  (e.g.,  the  UNcP  condition). 
That  is,  the  observers  may  not  have  been  sensitive  to  the 
trial-to-trial  variability  of  the  sources.  The  improvement 
in  weighting  efficiency,  given  the  luminance  cue  in  the  UCM 
condition,  indicates  greater  attention  or  weight  directed 
toward  these  elements.  It  does  not  necessarily  suggest  that 
observers  have  improved  sensitivity  to  the  differences  in 
element  reliability.  To  test  this  possibility,  observer  S4, 
contributed  data  to  an  additional  condition  in  which  gauge 
luminance  was  inversely  related  to  gauge  reliability. 

Figure   17  shows  subject  S4's  average   weight   estimates 
for  the  two  luminance  cue  conditions,  UCM  and  reverse   cue, 
and  the  four  reliability  patterns,  Grouped-Left  and   -Right 
and   Distributed-Even   and  -Odd.   The   circles   represent   a 
direct  relationship  between  gauge  luminance  and  reliability, 
the   UCM  condition.  The  triangles  represent  the  reverse   cue 
condition.    The  weights  estimated  for  these  two   conditions 
are   nearly  identical.   This  was  reflected  in  the  weighting 
efficiencies.   The  largest  difference  between  the  two   lumi- 
nance cue  conditions  in  weighting  efficiency  was  0.02;  for 
the  Grouped-Left  pattern  f]^  ^  =  0.97  for  the  UCM  condition, 
and  ri       =   0.9  5  for  the  reverse  cue  condition.   Thus,  at 
least  on  the  initial  trials  in  a  given  block  the  observer 
had  to  be  sensitive  to  differences  in  element  variability  to 
detect  which  elements  were  more  reliable. 
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Figure  17.   The  average  weights  for  subject  S4  in  the  UCM 
and  reversed  cue  conditions. 
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However,  there  remains  a  question  as  to  whether  or  not 
observers  were  actually  using  trial-to-trial  variability  of 
the  sources  to  make  their  decisions.  Some  of  the  observers 
showed  a  fair  amount  of  accuracy  in  weighting  sources 
according  to  reliability  in  the  UNcM  condition  where  trial- 
to-trial  variability  could  not  be  used  to  identify  which 
sources  were  more  reliable.  One  possible  explanation  for 
this  performance  is  that  observers  were  sensitive  to  pat- 
terns produced  by  gauge  markers  when  the  variability  was 
low. 

On  a  given  trial,  the  markers  of  the  high  reliability 
gauges  tended  to  fall  at  a  common  vertical  position  in  the 
display,  causing  them  to  line  up  as  shown  in  figure  18. 
Thus,  this  pattern  may  have  drawn  observers  attention, 
helping  them  to  identify  which  sources  were  more  reliable. 
After  questioning  the  observers  about  strategies  used  on 
these  trials,  it  was  confirmed  that  observers  were  sensitive 
to  such  patterns  in  the  display.  Based  on  this  evidence 
alone  it  is  not  conclusive  that  observers  were  sensitive  to 
the  underlying  variability  of  the  sources.  Rather  it  is 
more  probable  that  cues  provided  and  the  patterns  inherent 
in  displays  helped  observers  to  weight  sources  according  to 
reliability. 

Finally,  the  especially  good  performance  for  the  odd 
arrangement  raises  another  question.  Why  is  performance  so 
good  for  the  odd  arrangement  in  the  non-cued  conditions? 
This   advantage  may  be  due  to  the  unique  characteristics   of 
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Grouped-Left  High 


Grouped-Right  High 


Distributed-Even  High 


Distributed-Odd  High 


Figure  18.  Demonstration  of  the  possible  patterns  for 
four  arrangements  which  observers  may  have  used  to 
identify  the  more  reliable  sources. 
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the  arrangement.  The  odd  arrangement  was  the  only  condition 
with  five  sources,  rather  than  four,  which  were  high  in 
reliability.  As  a  result,  it  had  more  high  reliability 
sources  distributed  throughout  the  visual  field,  and  it  was 
the  only  arrangement  that  had  a  source  high  in  reliability 
located  at  the  fixation  point.  Secondly,  in  order  to  main- 
tain equal  levels  of  predicted  ideal  performance,  there 
were  smaller  differences  between  the  variances  for  high  and 
low  reliability  sources.  Thus,  based  on  both  of  these 
factors,  this  condition  most  closely  approximated  an  equal 
reliability  condition.  If  observers  resorted  to  weighting 
sources  equally  when  they  are  under  time  stress  or  unable  to 
identify  which  sources  are  reliable,  this  strategy  would 
prove  most  useful  in  the  odd  condition. 

In  conclusion,  from  this  study  we  can  state  that  when 
observers  have  to  utilize  information  from  multiple  subordi- 
nate displays  they  are  relatively  inefficient  at  identify- 
ing differences  in  reliability  among  the  displays.  However, 
there  is  improved  efficiency  when  the  display  elements  are 
coded  by  luminance.  The  assumption  is  that  this  cue,  and 
possibly  other  cues,  help  observer's  to  prioritize  the 
information  by  indicating  where  attention  should  be  direct- 
ed. Additional  assistance  may  be  gained  by  organizing  the 
displays  such  that  sources  similar  in  reliability  are  proxi- 
mate to  one  another. 


EXPERIMENT  2 
Introduction 

This  study  continues  the  investigation  of  observers' 
ability  to  use  independent  visual  information  sources  in 
forming  detection  decisions.  Since  humans  are  often 
required  to  make  decisions  under  time  stress  in  many  real 
world  settings,  researchers  have  been  interested  in  identi- 
fying means  of  coding  visual  information  to  reduce  potential 
errors  and  optimize  human  performance.  One  approach  is  to 
assist  performance  by  creating  display  codes  which  capital- 
ize on  our  knowledge  of  human  sensory  and  perceptual  mecha- 
nisms. For  instance,  Woods,  Wise,  and  Hanes  (1981)  reduced 
the  complexity  of  integrating  multiple  independent  sensor 
values  to  form  a  detection  decision  by  combining  display 
elements  into  a  single  geometric  form.  This  allowed  human 
monitors  to  use  shape  distortions  to  identify  important 
system  states. 

The  primary  concern  of  this  study  was  to  determine 
whether  two  factors  related  to  display  element  arrangement 
affect  observers'  detection  decisions.  The  first  factor 
concerns  the  influence  of  emergent  features  on  observers' 
detection  decisions.  Emergent  features  are  defined  as 
properties  that  arise  from  the  configuration  of  "simple" 
elements  that  are  not  identifiable  in  any  given  element 
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(Treisman,  1986) .  For  instance,  if  the  elements  are  repre- 
sented by  three  line  segments,  then  depending  on  the 
arrangement  chosen  we  could  create  one  of  the  forms  shown  in 
figure  19.  Particular  element  arrangements  produce  features 
such  as  angles  and  intersections  which  are  not  observable 
given  the  individual  lines.  Moreover,  some  element  arrange- 
ments produce  global  features  which  are  recognizable 
objects.  For  instance,  the  first  and  last  element  arrange- 
ments in  Figure  19  do  not  possess  as  strong  an  object  quali- 
ty as  the  triangle  found  in  the  middle. 

There  is  mixed  evidence  in  the  object  perception  litera- 
ture regarding  whether  emergent,  object-like,  features  of 
simple  element  arrangements  facilitate  or  hinder  detection 
of  the  underlying  elements.  Some  studies  have  found  evi- 
dence suggesting  an  "object-superiority  effect."  That  is, 
when  a  target  feature  (e.g.  a  line  segment  of  a  given  orien- 
tation) is  embedded  in  a  contextual  pattern,  observer  detec- 
tion performance  is  facilitated  when  the  target  feature  and 
context  form  a  recognizable  object  (Weisstein  &  Harris, 
1974).  Similarly,  Ankrum  and  Palmer  (1991)  found  that 
observers  were  better  at  detecting  differences  between  two 
element  arrangements  which  formed  objects  than  between 
element  arrangements  in  which  one  was  an  object  and  the 
other  was  part  of  an  object.  This  enhanced  detectability 
may  be  related  to  the  familiarity  of  the  organized  pattern 
(Purcell  &  Stewart;  1991),  similar  to  the  "word   superiority 
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Figure  19.   Example  of  three  line-graph  arrangements 

demonstrating  an  emergent  feature.   The  middle  figure 
produces  an  emergent  object-like  feature. 
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effect"  where  it  is  easier  to  detect  a  specific  letter  in  a 
word  than  in  a  nonword  (Reicher,  1969) . 

Others  have  found  contradicting  evidence.  Pomerantz 
(1981)  points  to  evidence  suggesting  that  when  elements 
perceptually  group,  the  emergent  feature  created  by  the 
configuration  may  be  more  perceptually  salient  and  selective 
attention  to  the  underlying  elements  may  be  impeded  (Pomer- 
antz &  Schwaitzberg,  1975)  .  Similarly,  Navon  (1977)  found 
Stroop  interference  of  global  configurations  on  subjects' 
processing  of  local  elements,  but  not  the  opposite.  Bennett 
and  Flach  (1992)  summarize  the  results  from  a  number  of 
studies  which  have  applied  this  concept  to  real  world  set- 
tings. These  studies  indicated  that  an  emergent  object-like 
property  had  no  affect  on,  or  adversely  affected,  perform- 
ance in  tasks  requiring  focused  attention  to  the  individual 
elements.  However,  they  pointed  out  that  performance  had 
been  facilitated  by  the  same  emergent  features  when  the 
detection  task  required  information  integration. 

One  hypothesis,  tested  in  earlier  studies,  stated  that 
the  magnitude  of  the  performance  advantage  in  integration 
tasks,  relative  to  selective  attention  tasks,  depended  upon 
the  degree  to  which  the  element  configuration  produced  an 
object-like  feature  (Carswell  &  Wickens,  1987;  Wickens  & 
Andre,  1988)  .  The  degree  of  "objectness"  depended  on  wheth- 
er or  not  the  element  configuration  possessed  an  enclosed 
contour  (Wickens  &  Andre,  1990)  .  However,  later  evidence 
suggested   that  the  important  factor  was  whether  or   not   an 
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emergent  feature  carried  important  information  about  the 
underlying  state,  rather  than  simply  the  object  quality  of 
the  configuration.  For  instance,  Buttigieg  and  Sanderson 
(1991)  found  that  object  displays  did  not  always  produce  the 
best  performance  in  integration  tasks,  whereas  "well-mapped" 
emergent  features  did. 

However,  recent  investigations,  designed  specifically  to 
address  the  relationship  between  the  configuration  and  the 
task,  have  produced  mixed  results.  Some  researchers  have 
found  support,  suggesting  that  performance  was  better  when 
there  was  a  strong  relationship  between  some  property  of  the 
emergent  feature  and  the  decision  statistic  than  when  this 
relationship  was  weak  (Bennett,  Toms,  &  Woods,  1993;  Mitch- 
ell &  Biers,  1992;  Schmidt  &  Elvers,  1992).  Others 
researchers  did  not  find  a  performance  advantage  for  emer- 
gent features  that  were  "well-mapped"  to  the  decision  task 
(Sanderson,  Haskell,  &  Flach,  1992). 

The  second  factor  addressed  in  this  study  concerns  the 
importance  of  the  relationship  between  the  emergent  feature 
and  the  optimal  decision  statistic  for  the  task.  One  nice 
characteristic  of  the  Theory  of  Signal  Detectability  (TSD, 
Green  and  Swets,  19  66)  paradigm  is  that  we  can  mathemati- 
cally specify  the  optimal  decision  statistic  in  a  detection 
task.  The  optimal  decision  statistic  is  the  likelihood 
ratio  or  some  value  which  is  monotonically  related  to  the 
likelihood  ratio.  If  there  is  a  direct  relationship  between 
an  emergent  property  in  the  display  (e.g.,  size  or  area)  and 
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the   optimal   decision   statistic,  it  is   expected   that   the 
observers  can  use  changes  in  the  magnitude  of  this   emergent 
property  to  make  their  decisions. 

In  this  study,  observers  were  presented  four  independent 
informational  sources  coded  as  graphical  elements  in  a 
visual  display.  Six  display  codes  were  constructed  from  a 
combination  of  two  factors,  1)  whether  or  not  the  display 
element  arrangement  produced  an  emergent  feature,  and  2) 
whether  or  not  the  emergent  feature  had  some  property  that 
was  directly  related  to  the  optimal  decision  statistic  (for 
the  Yes/No  task) .  Observers  used  this  information  to  per- 
form either  a  Yes/No  task  or  a  4AFC  task. 

The   magnitude   of  a  given  source  was   determined  by   a 
normal   random  variable  which  depended   on   the   underlying 
state   (Signal   or  Noise) .   For  signal   events,   the   source 
values   were  selected  from  a  distribution  with  a  mean  of   u 
and  a  standard  deviation  of  a   .   The  values  of  noise  sources 

s 

were   drawn   from   a  distribution  with  a  mean  of   u  and   a 
standard   deviation  of  a  ,    where  a    <   a    and  o    =   a     =      a  . 

n'  '^n    '^s        s      n        e 

In  the  Yes/No  task,  observers  had  to  decide  whether  all 
source  values  presented  on  a  given  trial  represented  either 
a  signal  state  or  noise.  In  the  4AFC  task,  the  observer  had 
to  decide  which  of  four  elements  represented  the  signal 
event . 

If  emergent  features  facilitate  the  processing  of  the 
underlying  elements,  then  we  may  find  more  efficient  deci- 
sion making  performance  for  both  Yes/No  and   4AFC   decision 
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tasks  when  such  features  are  present.  Alternatively,  if 
these  emergent  features  are  processed  faster  than  the  under- 
lying elements,  then  decision  tasks  which  require  sensi- 
tivity to  the  underlying  elements  may  be  hindered  by  display 
codes  that  possess  these  features.  For  instance,  in  an 
equal  reliability  Yes/No  task,  sensitivity  to  the  separate 
underlying  elements  is  of  less  importance  to  performance 
than  it  is  in  a  4 AFC  task.  As  a  result,  if  the  emergent 
feature  hinders  observer  sensitivity  to  the  underlying 
elements,  performance  on  a  4AFC  task  may  be  less  efficient 
when  the  information  is  arranged  to  form  an  emergent  feature 
than  when  the  element  arrangement  does  not  posses  this 
feature. 

With  respect  to  the  second  factor,  if  the  relationship 
between  the  magnitude  of  some  emergent  feature  property  and 
the  optimal  decision  statistic  is  important  then  we  should 
see  a  performance  advantage  when  this  relationship  is 
present.  In  the  current  study,  this  relationship  was  coded 
only  for  the  Yes/No  decision  task.  Two  of  the  six  display 
arrangements  produced  an  emergent  feature  in  which  the  width 
or  the  area  of  this  feature  was  directly  related  to  the 
optimal  decision  statistic  for  the  Yes/No  task.  Thus,  it 
was  expected  that  detection  performance  would  be  facilitated 
in  the  Yes/No  task  given  these  two  display  codes  relative  to 
the  other  codes  which  do  not  possess  this  relationship. 

Finally,  the  object-like  quality  of  the  emergent  feature 
was  also  tested  in  this  study.   Some  of  the  element 
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arrangements  produced  emergent  features  which  possessed  an 
enclosed  contour,  whereas  others  did  not.  If  the  configural 
property  of  the  display  arrangement  is  an  important  factor 
(Carswell  &  Wickens,  1987) ,  then  display  arrangements  which 
possess  this  property  may  be  more  likely  to  show  the  expect- 
ed emergent  feature  effects  than  arrangements  without  an 
enclosed  contour. 

Method 
Subjects 

Three  of  the  four  subjects  who  participated  in  the  first 
study   also  contributed  data  in  this  study.    All   observers 
were  paid  an  hourly  wage  plus  a  bonus  based  on  performance. 
Apparatus  and  Stimuli 

Observers  were  seated  in  a  sound  isolated  booth  approxi- 
mately 27  inches  away  from  a  10.5  inch  color  monitor  (EGA). 
The  monitor  was  set  for  maximum  contrast.  Intensity  was  set 
at  approximately  100  cd/m'  measured  from  a  uniform  white 
field  covering  the  monitor.  On  a  given  trial,  one  of  six 
arrangements  of  four  graphical  elements  was  presented  on  the 
monitor  against  a  gray  grid.  The  maximum  horizontal  and 
vertical  visual  angles  were  13.5°  and  4.5°,  respectively. 
(The  separate  measures  of  visual  angle  for  each  of  the 
displays  are  found  on  figures  20  and  21.)  The  values 
depicted  by  the  graphical  elements  were  either  drawn  from  a 
signal  or  noise  distribution,  depending  on  the  trial  and 
task.   The  parameters  of  the  signal  and  noise   distributions 
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were   u  =  50  and  a  =  10  and  u  =  40  and  a„ =   10,   respec- 
tively.   Element  magnitude  ranged  from  1  to  99. 

Figures  20a  -  20c  depict  three  line-graph  display  codes 
in  which  element  magnitude  was  coded  by  the  length  (number 
of  pixels)  of  a  horizontal  line  segment.  Figure  20a  repre- 
sents the  linear  likelihood  (LIN-LR)  arrangement.  In  the 
LIN-LR  arrangement,  there  was  a  fixed  separation  between 
the  end  of  one  line  segment  and  the  beginning  of  the  next. 
Thus,  in  the  Yes/No  task  the  total  length  of  the  display 
produced  by  the  separate  segments  varied  directly  with  the 
likelihood  ratio.  Figures  20b  and  20c  represent  two  ver- 
sions of  the  linear  non-likelihood  arrangement.  In  both 
cases,  the  onset  of  each  line  segment  began  at  a  specified 
location  in  the  visual  field;  total  display  length  did  not 
vary.  In  one  case  the  elements  were  arranged  horizontally 
(LIN-NL) ,  and  in  the  other  case  they  were  arranged  in  a 
square  (LSQ-NL) ,  to  control  for  differences  in  visual  angle. 

Figures  21a  -  21c  depict  three  angular  displays.  The 
angle  formed  by  two  line  segments  in  a  given  quadrant  was 
directly  related  to  the  magnitude  of  the  underlying  element. 
One  end  of  each  line  segment  was  fixed  in  position  on  the 
display.  The  opposite  ends  of  the  two  segments  joined  to 
form  an  angle.  The  small  arrows  in  figures  21a  -  21c  desig- 
nate the  angles  being  described.  The  size  of  the  angle 
varied  with  element  magnitude  as  follows: 

Angle  =  270  -  2tan"^  (  ( lOO-x,.) /x.)  ,  (22) 

where   x,  is  the  magnitude  of  the  i*^^  element.    In   figure 
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a.)  Linear  Likelihood  (LIN-LR)  Arrangement 
Yes/No  Signal  Trial 


Yes/No  Noise  Trial 


Average  Visual  Angle:  Horizontal  =  6.75' 

Vertical    =  1.6° 


b.)  Linear  Non-Likelihood  Horizontal  (LIN-NL)  Arrangement 
Yes/No  Signal  Trial 


Yes/No  Noise  Trial 


Average  Visual  Angle:  Horizontal  =  11.25' 

Vertical   =   1.6° 


c.)  Linear  Non-Likelihood  Square  (LSQ-NL)  Arrangement 
Yes/No  Signal  Trial  Yes/No  Noise  Trial 


Average  Visual  Angle:  Horizontal  =  6.2° 

Vertical    =6.3° 


Figure  20,   Line  graph  display  codes.   Figures  a,  b  and  c 

are  the  Linear  Likelihood  (LIN-LR) ,  Linear  Non-Likeli- 
hood (LIN-NL) ,  and  Linear  Non-Likelihood  Square  (LSQ-NL) 
displays.  Each  display  was  presented  in  front  of  a  gray 
grid  and  the  visual  angle  subtended  is  listed  at  the 
bottom  of  each  figure. 
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a.)  Object  Likelihood  (OBJ-LR)  Arrangement 

Yes/No  Signal  trial  Yes/No  Noise  trial 


Average  Visual  Angle:  Horizontal 

Vertical 


6.3° 
6.3° 


b. )  Object  Non-Likelihood  (OBJ-NL)  Arrangement 

Yes/No  Signal  trial  Yes/No  Noise  trial 


)rs 


Average  Visual  Angle:  Horizontal  =  6.3° 

Vertical   =6.3° 

c.)  Angular  Non-Likelihood  (ANG-NL)  Arrangement 

Yes/No  Signal  trial  Yes/No  Noise  trial 


f^ 


Average  Visual  Angle:  Horizontal  =  6.3° 

Vertical    =  6.3° 

Figure  21.  Angular  display  codes.  Figures  a,  b  and  c  are 
the  Object  Likelihood  (OBJ-LR) ,  Object  Non-Likelihood 
(OBJ-NL) ,  and  Angular  Non-Likelihood  (ANG-NL)  displays. 
Each  display  was  presented  in  front  of  a  gray  grid  and 
the  visual  angle  subtended  is  listed  at  the  bottom  of 
each  figure. 
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21a,  the  smaller  element  values  correspond  with  angles   which 
point  toward  the  center  of  the  display.   In  figures  21b   and 
21c,   smaller  element  values  correspond  with   angles  which 
point  downward. 

In  two  of  the  figures  (21a  and  21b)  the  elements  were 
arranged  to  form  an  enclosed  contour,  producing  object-like 
shapes.  In  figure  21a  the  element  arrangement  produced  an 
object  in  which  the  area  was  directly  related  to  the  likeli- 
hood ratio  for  the  Yes/  No  task  (OBJ-LR) .   That  is, 

Area   =  SlOOXj.  (23) 

1 

Figure  21b  represents  an  object-like  display  that  does  not 
have  a  property  related  to  the  optimal  decision  statistic 
(OBJ-NL) .  Finally,  figure  21c  depicts  the  ANG-NL  display 
which  is  identical  to  the  OBJ-NL  display;  however,  it  does 
not  posses  a  continuous  enclosed  contour. 
Procedure 

For  the  Yes/No  task,  observers  were  told  to  make  their 
decisions  based  on  the  average  magnitude  of  the  gauges,  and 
to  rank  the  likelihood  that  the  evidence  represented  a 
signal  by  using  the  "4",  "3",  "2"  and  "1"  keys  of  a  101-key 
keyboard,  where  "4"  represented  "very  sure  it  was  a  signal" 
and  "1"  represented  "noise".  Again,  "1"  and  "2"  responses 
were  identified  as  noise,  and  "3"  and  "4"  responses  were 
identified  as  signals  in  the  analyses.  On  4AFC  task,  observ- 
ers were  told  to  identify  which  of  the  four  gauges  had  the 
greatest  magnitude,  and  thus,  represented  a   signal   event. 
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Subjects  indicated  the  signal  location  by  using  either  the 
same  keys  found  in  the  Yes/No  task,  or  the  "Ins",  "Home", 
"Del"  and  "End"  keys,  depending  on  the  element  arrangement. 

The  trial  sequence  was  the  same  as  the  first  study  (see 
Figure  4) ,  However,  instead  of  nine  line-graph  gauges,  one 
of  the  six  display  arrangements  described  above  were  pre- 
sented for  a  stimulus  duration  of  200  ms.  (The  stimulus 
duration  was  controlled  in  the  same  manner  as  described  in 
experiment  1.)  Each  observer  received  eight  blocks  of 
practice  for  each  of  the  display  arrangements  and  tasks 
before  the  experimental  data  was  collected.  For  both  the 
practice  and  experimental  trials,  the  display  arrangements 
were  randomly  presented  and  each  observer  received  a  differ- 
ent random  order.  In  a  given  session,  a  subject  ran  through 
eight  blocks  of  the  Yes/No  task  and  eight  blocks  of  the  4AFC 
task,  and  they  contributed  data  to  eight  blocks  of  one  task 
before  beginning  the  next  task.  The  order  of  the  tasks 
alternated  across  experimental  sessions. 

Since  performance  in  the  initial  experimental  blocks  was 
nearly  ideal,  a  random  noise  pattern  was  added  to  the  dis- 
plays to  degrade  performance.  The  random  noise  pattern 
consisted  of  750  white  spots,  two  pixels  in  width.  On  each 
trial,  the  locations  of  the  spots  were  randomly  determined. 
Thus,  the  pattern  varied  across  trials.  This  noise  pattern 
overlaid  the  graphical  elements  and  background  grid,  such 
that  the  extent  of  the  random  noise  was  confined  to  the 
vertical   and  horizontal  dimensions  of  the  background  grid. 
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Approximately   12%   of  the  grid  region  was   covered   by   the 

random  noise  pattern. 

Subjects  performance  in  the  Yes/No  task  was  poorer  with 

the   random  noise  pattern  than  without.   Overall   efficiency 

dropped   approximately  16%  when  the  random  noise  was   added. 

The  size  of  the  differences  increased.   However,  the  general 

pattern   of  effects  did  not  change.   Thus,  the   experimental 

trials  included  the  random  noise  pattern. 

Results 
Yes-No  Task 

The  observers'  accuracy  (d')  and  mean  reaction  time 
measures  for  the  six  display  arrangements  in  the  Yes/No  task 
are  shown  in  figures  22  and  23,  respectively.  Three  panels, 
a-c,  in  each  figure  represent  the  individual  subjects'  data, 
and  the  fourth,  d,  is  the  three  observers  average  data. 
Each  of  the  subjects  d'  and  reaction  time  measures  are  based 
on  eight  blocks  of  100  trials,  and  the  error  bars  represent 
one  standard  error  of  the  mean. 

Separate  repeated  measures  ANOVAs  were  performed  on  the 
observers'  d'  and  reaction  time  measures,  collapsed  across 
the  eight  trial  blocks.  There  was  an  effect  of  type  of 
display  arrangement  (F(5 , 10) =11. 263 ,  2  <0.001)  on  observer 
accuracy  (d's).  Subsequent  analytic  comparisons,  using  the 
pooled  variance  as  the  error  term,  indicated  that  perform- 
ance was  greater  for  element  arrangements  that  produced 
emergent  features  (F(l, 10) =23 . 2 ,  p  <0.001)  relative  to  those 
that  did  not  have  such  features.  Whether  or  not  the  emer- 
gent feature  produced  an  enclosed  contour  did  not   influence 
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Figure  22.  The  observers'  average  performance  (d')  measures 
for  the  six  arrangements  in  the  Yes/No  task.  Panels  a-c 
represent  the  individual  subjects  and  panel  d  is  the 
average  data.  The  error  bars  are  one  standard  error  of 
the  mean. 
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Figure  23.  The  observers'  average  reaction  times  measures 
(measured  from  the  offset  of  the  mask  to  response)  for 
the  six  conditions  in  the  Yes/No  task.  Panels  a-c 
represent  the  individual  subjects  and  panel  d  is  the 
average  data.  The  error  bars  are  one  standard  error  of 
the  mean. 
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performance.  However,  for  the  non-emergent  feature  dis- 
plays, performance  was  significantly  better  (F (1, 10) =30 . 8 ,  £ 
<0.001)  for  the  ANG-NL  display  relative  to  the  LIN-NL  and 
LSQ-NL  displays.  This  performance  advantage  among  the  non- 
emergent  feature  displays  may  be  a  function  of  the  underly- 
ing angular  element  code,  especially  since  the  difference 
between  the  ANG-NL  and  OBJ-NL  display  arrangements  was  not 
significant. 

The  effect  of  the  likelihood  ratio  manipulation  was 
significant  only  for  the  line-graph  displays 
(F(l, 10)=21.499,p  <0.001).  Performance  in  the  LIN-LR  condi- 
tion was  better  than  performance  given  the  other  two  linear 
displays,  LIN-NL  and  LSQ-NL.  Finally,  there  was  a  signifi- 
cant difference  among  the  observers'  reaction  time  measures 
(F(5, 10)=13.4,  p  <0.001)  for  the  separate  element  arrange- 
ments. Reaction  time  was  slower  given  a  LSQ-NL  display  code 
relative  to  the  other  element  arrangements. 
4AFC  Task 

Figures  24  and  25  depict  the  observers'  accuracy  (d') 
and  reaction  time  measures  for  the  six  display  arrangements 
in  the  4AFC  task.  Again,  three  panels,  a-c,  in  each  figure 
represent  the  individual  subjects'  data  and  the  fourth 
panel,  d,  is  the  average  for  the  three  observers.  Each  of 
the  subjects  d'  and  reaction  time  measures  are  based  on 
eight  blocks  of  100  trials,  and  the  error  bars  are  one 
standard  error  of  the  mean. 
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Figure  24.  The  observers'  average  performance  (d')  measures 
for  the  six  arrangements  in  the  4AFC  task.  Panels  a-c 
represent  the  individual  subjects  and  panel  d  is  the 
average  data.  The  error  bars  are  one  standard  error  of 
the  mean. 
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Figure  25.   The  observers'  average  reaction  times  measures 

(measured  from  the  offset  of  the  mask  to  response)  for 
the  six  conditions  in  the  4AFC  task.  Panels  a-c  repre- 
sent the  individual  subjects  and  panel  d  is  the  average 
data.  The  error  bars  are  one  standard  error  of  the  mean. 
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Again,  separate  repeated  measures  ANOVAs  were  performed 
on  the  observers'  d'  and  reaction  time  measures,  collapsed 
across  the  eight  trial  blocks.  There  was  a  significant 
effect  of  display  arrangement  (F(5 , 10) =4 . 134 ,2  <0.05)  on 
observers'  accuracy.  Subsequent  analytic  comparisons,  using 
the  pooled  variance  for  the  error  term,  indicated  that  there 
was  not  an  overall  difference  between  displays  with  and 
without  emergent  features.  However,  there  was  a  performance 
advantage  (F ( 1 , 10) =5 . 63 ,  £  <0.05)  for  display  arrangements 
that  produced  an  emergent  feature  with  an  enclosed  contour 
(e.g.,  OBJ  displays)  relative  to  an  emergent  feature  without 
this  property  (LIN-LR) .  There  was  also  a  significant 
(F(l,10)=11.75,  2  <0.01)  difference  between  the  ANG-NL  and 
the  two  line-graph  display  arrangements  that  did  not  possess 
an  emergent  feature  (LIN-NL  and  LSQ-NL) ,  though.  Given  this 
latter  difference,  these  effects  may  be  best  characterized 
in  terms  of  differences  between  line-graph  and  angular 
element  coding. 

Based  on  the  accuracy  data  alone,  observers  tended  to 
show  poorer  performance  when  emergent  features  were  present. 
The  observers  showed  the  lowest  performance  for  the  LIN-LR 
display  arrangement  and  highest  performance  for  the  ANG-NL 
display  arrangement.  In  addition,  among  the  angular  display 
codes  all  observers  showed  lowest  performance  for  the  OBJ-LR 
arrangement. 

However,  the  reaction  time  data  did  not  completely 
support  this  pattern  of  effects.   Unlike  the  accuracy  data. 
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analysis  of  reaction  times  indicated  that  reaction  time  was 
faster  (F(l, 10) =11 . 6 ,  2  ^0.01)  for  display  element  arrange- 
ments that  produced  emergent  features  than  those  that  did 
not  possess  these  features.  There  was  evidence  for  speed- 
accuracy  tradeoffs  among  the  separate  linear  and  angular 
displays.  For  instance,  among  the  linear  displays  the  LIN- 
LR  arrangement  showed  less  accuracy,  but  faster  reaction 
times  than  the  other  two  line  graph  displays.  However, 
there  remained  a  performance  advantage  for  angular  element 
coding  compared  to  the  LIN-LR  display  arrangement. 

Discussion 

There  is  a  great  deal  of  interest  from  both  theoretical 
and  practical  perspectives  in  how  human  detection  perform- 
ance is  affected  by  element  configuration.  One  of  the  main 
issues  concerns  whether  emergent  features  produced  by  se- 
lected element  arrangements  help  or  hinder  processing  of  the 
underlying  elements.  Evidence,  so  far,  suggests  that  the 
impact  on  performance  may  depend  upon  which  feature  is  most 
salient  (Pomerantz,  1981)  and  how  well  this  feature  relates 
to  the  task  being  performed  (Buttigieg  &  Sanderson,  1991)  . 

This  experiment  compared  performance  in  Yes/No  and  4AFC 
detection  tasks,  for  different  arrangements  of  the  line 
element  components  of  simple  visual  displays.  From  the 
Yes/No  task  data,  it  was  found  that  observer  accuracy  was 
affected  by  the  display  arrangement  when  observers  were 
presented  a  line-graph  display  code.  Observer  accuracy  was 
highest   when   line-graph  display  arrangements   produced   an 
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emergent  feature  (LIN-LR) ,  and  performance  in  this  condition 
was  indistinguishable  from  the  angular  display  code  arrange- 
ments which  consistently  produced  superior  performance. 

Alternatively,  the  same  feature  appeared  to  hinder 
performance  in  the  4AFC  task,  which  required  focused  atten- 
tion to  the  separate  elements.  That  is,  accuracy  was  rela- 
tively low  and  reaction  time  was  higher  for  the  line-graph 
display  arrangement  which  possessed  an  emergent  feature. 
Similarly,  there  was  a  tendency  for  poorer  performance  given 
an  emergent  angular  display,  OBJ-LR,  relative  to  an  non- 
emergent  angular  display,  ANG-NL.  This  pattern  of  effects 
would  be  expected  if  attention  is  automatically  directed 
toward  the  emergent  feature,  and  additional  processing 
capacity  has  to  be  invoked  to  gather  information  from  the 
underlying  elements  (Navon,  1977;  Pomerantz  &  Schwaitzberg, 
1975)  . 

Although  there  is  no  strong  evidence  suggesting  an 
effect  of  the  relationship  between  the  emergent  feature  and 
the  decision  statistic,  it  is  not  possible  to  completely 
rule  out  this  factor.  For  instance,  in  the  Yes/No  task  the 
emergent  feature  advantage  observed  for  the  LIN-LR  arrange- 
ment relative  to  the  other  line  graph  displays  could  also  be 
explained  as  a  difference  due  to  the  relationship  between 
the  emergent  feature  and  the  decision  statistic.  This  is 
true  because  the  LIN-LR  arrangement  possessed  both  factors, 
where  the  other  two  displays  possessed  neither  of  these 
factors.   It  was  only  by  comparing  performance  across   tasks 
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that  we  could  draw  conclusions  about  which  factors  were 
influencing  performance.  Furthermore,  some  caution  should 
be  exercised  when  interpreting  the  results  since  the  poor 
performance  observed  in  the  LIN-LR  display  in  the  4AFC  task 
may  be  related  to  masking  effects  of  processing  nearby  items 
in  the  visual  field  (Eriksen  &  Eriksen,  1974)  . 

Wickens  and  others  have  argued  that  this  pattern  of 
effects  may  be  explained  in  terms  of  the  proximity  compati- 
bility principle  (Andre  &  Wickens,  1988;  Carswell  &  Wickens, 
1987;  Wickens,  1992).  According  to  the  proximity  compati- 
bility principle,  tasks  that  require  integration  are  better 
supported  by  display  arrangements  which  have  high  perceptual 
proximity;  whereas,  tasks  which  require  focused  attention 
are  better  supported  by  arrangements  which  have  low  percep- 
tual proximity.  If  there  were  more  distinguishable  differ- 
ences among  the  angular  element  display  arrangements  which 
produced  consistently  high  performance,  it  may  be  possible 
to  further  eliminate  alternative  hypotheses. 

This  raises  another  question.  Why  was  performance  so 
good  for  displays  composed  of  angular  elements?  One  expla- 
nation for  this  angular  display  code  advantage  is  that  it 
is  easier  to  extract  magnitude  information  when  it  is  coded 
as  changes  in  the  size  of  an  angle  than  when  it  is  coded  as 
the  length  of  a  line  segment.  That  is,  the  angle  may  empha- 
size the  scale  of  the  underlying  element  magnitude.  Alter- 
natively, observers  may  have  used  the  direction  of  the  angle 
rather   than  the  size  of  the  angle  or  the  area   enclosed   by 
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the  angle  to  make  their  decisions.  For  instance,  observers 
may  have  identified  angles  pointing  toward  the  center  of  the 
display  (OBJ-LR)  or  downward  (OBJ-NL  or  ANG-NL)  as  repre- 
senting "noise,"  and  the  opposite  as  indicating  "signal" 
events.  Thus,  their  decisions  would  have  been  based  on 
binary  information  rather  than  the  actual  magnitudes  of  the 
underlying  elements. 

A  simple  test  of  the  latter  possibility  would  be  to 
conduct  a  study  in  which  the  tasks  and  displays  are  identi- 
cal to  those  used  in  the  preceding  study.  However,  the 
distribution  parameters  would  be  manipulated  so  that  in  one 
case  observers  could  use  the  direction  of  the  angle  and  in 
the  other  cases  they  could  not.  For  instance,  if  the  under- 
lying signal  and  noise  distributions  had  relatively  small  or 
large  means  then  most  of  the  angles  would  be  either  small  or 
large,  respectively.  Thus,  the  direction  of  the  angles 
would  be  less  useful  in  forming  a  detection  decision.  If 
observers  are  using  the  magnitude  of  the  angles,  there 
should  be  no  differences  among  the  selected  pairs  of  means, 
as  long  as  the  distance  between  the  distribution  means  and 
the  standard  deviations  were  held  constant. 


GENERAL  CONCLUSIONS 

We  investigated  whether  selected  display  coding  factors 
would  assist  observers  in  visual  signal  detection.  The 
factors  investigated  were  partially  selected  based  on  knowl- 
edge of  the  predicted  optimal  observer  as  defined  in  the  TSD 
paradigm  (Green  &  Swets,  1966) .  The  evidence  from  these 
studies  suggests  potential  means  for  coding  displays  that 
will  assist  observers  in  forming  detection  decisions. 

In  the  first  study,  observers  performed  a  Yes/No  detec- 
tion task  where  the  separate  sources  differed  in  reliabili- 
ty. The  main  concern  was  to  determine  whether  observers 
included  these  differences  in  source  reliability  in  their 
detection  decisions.  Evidence  from  research  on  human  deci- 
sion making  suggests  that  when  informational  sources  differ 
in  inf ormativeness,  decision  makers  generally  do  not  consid- 
er these  differences  in  forming  their  decisions.  Instead, 
they  acts  as  though  the  sources  are  equally  informative 
(Wickens,  1984),  and  weight  them  accordingly.  Berg  (1990), 
also,  found  that  observers  were  better  at  weighting  sources 
equal  rather  than  unequal  in  reliability  in  an  auditory 
frequency  discrimination  task. 

The  results  from  the  first  study  were  consistent  with 
prior  evidence;  observers  were  generally  better  at  weighting 
sources  equal  rather  than  unequal  in  reliability.    However, 
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when  sources  high  in  reliability  were  cued  by  gauge  lumi- 
nance, weighting  efficiency  was  equivalent  to  the  equal 
reliability  condition.  Furthermore,  the  best  performance 
and  highest  weighting  efficiency  occurred  when  sources  high 
in  reliability  were  cued  by  gauge  luminance,  presented  at 
long  stimulus  durations,  and  contiguous  rather  than  distrib- 
uted throughout  the  visual  field.  The  performance  advantage 
associated  with  grouped  source  reliabilities  is  consistent 
with  the  results  of  Posner  et  al.  (1980)  . 

Examination  of  the  observers'  weights  indicate  that 
observers  tended  to  use  a  relatively  equal  weighting  strate- 
gy when  there  was  uncertainty  about  the  location  of  the  more 
reliable  sources.  This  may  partly  explain  the  performance 
advantage  for  the  odd  arrangement  in  the  non-cued  condi- 
tions, since  this  arrangement  was  most  similar  to  an  equal 
reliability  pattern.  However,  based  on  the  current  evidence 
it  is  not  clear  what  factor  accounts  for  this  advantage,  and 
why  it  is  not  maintained  in  the  cued  condition.  A  future 
study  which  examines  other  factors  that  may  contribute  to 
this  advantage  may  help  to  understand  these  effects.  For 
instance,  is  it  the  size  of  the  difference  between  sources 
high  and  low  in  reliability  or  the  distribution  of  sources 
reliabilities  which  produce  this  advantage? 

Despite  the  relatively  equal  weighting  pattern  used  in 
the  non-cued  conditions,  observers  showed  moderate  sensitiv- 
ity to  differences  in  source  reliability.  This  was  true 
even  when  observers  could  not  use  the  trial-to-trial   source 
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variability  to  identify  which  were  more  reliable  (e.g.  the 
UNcM  condition) .  This  may  be  related  to  the  consistency  in 
the  vertical  displacements  of  the  gauge  markers  for  the  high 
reliability  sources.  The  fairly  straight  line  produced  by 
the  gauge  markers  common  vertical  positions  in  the  visual 
field  may  have  engaged  observers  attention,  helping  them  to 
identify  which  sources  were  more  reliable.  Thus,  an  "emer- 
gent" pattern  produced  by  the  gauge  marker  arrangement  may 
have  assisted  observers  in  this  task. 

Whether  or  not  such  "emergent"  features  assist  observers 
in  making  detection  decisions  was  addressed  in  the  second 
study.  The  second  study  examined  the  effects  of  two  factors 
related  to  display  element  arrangement  on  observers'  detec- 
tion decisions  for  both  Yes/No  and  4AFC  detection  tasks. 
The  first  factor  was  whether  or  not  the  display  element 
arrangement  produced  an  emergent  feature.  That  is,  a  fea- 
ture which  is  produced  by  the  configuration  of  the  underly- 
ing elements,  but  not  present  in  any  given  element  (Treis- 
man,  1986) .  The  second  factor  was  whether  or  not  the  emer- 
gent feature  had  some  property  (e.g.  size  or  area)  that  was 
directly  related  to  the  optimal  decision  statistic  for  the 
Yes/No  detection  task.  Based  on  current  evidence,  many 
argue  that  the  important  factor  is  the  relationship  between 
the  task  and  the  display  code  (Bennett  &  Flach,  1992;  Ben- 
net,  Toms,  &  Woods,  1993;  Schmidt  &  Elvers,  1992;  Sanderson 
et  al. ,  1989)  . 
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Arranging  the  line  graph  displays  to  produce  an  emergent 
feature,  improved  Yes/No  performance  and  impaired  4AFC  per 
formance.  However,  it  was  not  possible  to  completely  rule 
out  the  effects  of  the  coded  relationship  between  the  opti- 
mal detection  statistic  and  some  property  of  the  emergent 
feature.  The  angular  display  arrangements  consistently 
produced  high  performance,  and  there  were  no  distinguishable 
differences  among  the  separate  angular  arrangements,  for 
either  task. 

This  angular  advantage  may  be  a  function  of  at  least  two 
possible  factors.  First  of  all,  it  may  be  easier  to  extract 
magnitude  information  when  it  is  coded  as  changes  in  the 
size  of  an  angle  than  when  it  is  coded  as  the  length  of  a 
line  segment.  That  is,  the  angle  may  emphasize  the  scale  of 
the  underlying  element  magnitude.  Alternatively,  the  ob- 
servers may  have  been  able  to  use  the  direction  of  the  angle 
to  make  their  decisions  about  the  underlying  state  of  the 
system.  A  future  study  may  provide  some  insights  into 
whether  or  not  these  factors  where,  in  fact,  producing  this 
performance  advantage.  Bennett,  Toms,  and  Woods  (1993) 
point  out  that  emphasizing  the  scale  of  the  underlying 
element  magnitude  helps  observers  to  process  the  underlying 
elements.  This  is  especially  important  when  attention  has 
to  be  focused  on  elements  which  are  arranged  to  produce  an 
emergent  feature. 

Thus,  by  defining  the  optimal  observer  we  can  (1)   iden- 
tify how  well  humans  perform  relative  to   the   theoretical 
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ideal,  and  (2)  identify  means  of  aiding  performance  based  on 
what  we  discover  is  causing  inferior  performance.  For 
instance,  introducing  a  luminance  cue  helps  observers  to 
prioritize  sources  according  to  their  informativeness. 
Furthermore,  using  an  angular  display  code  in  visual  signal 
detection  tasks  can  produce  nearly  ideal  performance  in  both 
Yes/No  and  4AFC  detection  tasks.  Otherwise,  designers 
should  attempt  to  create  display  codes  which  possess  "well- 
mapped"  emergent  features. 
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APPENDIX  A 
YES/NO  DECISION  STATISTIC 

In  a  Yes/No  detection  task  an  observer  is  presented  n 
independent  sources  of  information,  x^,  Xj,  .  .  .  ,  x^.  On  a 
given  trial,  each  element  represents  one  of  two  alternatives 
(signal  or  noise) .  On  signal  trials,  all  x.  are  sampled 
from  a  distribution  with  the  parameters  ^  •  and  a  ..  Noise 
trials  are  sampled  from  a  distribution  with  parameters  /i  . 
and  a  . ,    where  a  .  =  a  .  -   o  . ,    and  u  ■  <  u  . . 

The  likelihood  that  this  evidence  represents  a  signal  is 

related  to  the  probability  of  their  joint  occurrence, 

L(x^,X2,  .  .  .  ,xj  =  L(x,)*L(X2)  .  .  .*L(X3)  .  (Al) 

The  likelihood  ratio  for  a  given  source,  x.,  is  the  ratio  of 

the  conditional  probabilities  for  that  source.   That  is, 

f(x./s)    [l/(2no^^)'''^]EXP[-h({x.-^M^.)/a^.)^  ] 

L(x.)  =    =   ; (A2) 

f  (x./n)    [l/(27ra^p^^]EXP[-M(x,.-M„)/a^j)^  ]. 

Once   the  equation  is  reduced,  based  on  the  general  laws   of 

exponents  we  have  the  following: 

L(x.)  =  EXP[-!5(((x,.-M,i)/a^,.)^  -  ((X;-M„)/a^,.)M]-        (A3) 

By  taking  the  natural  logarithm  of  the   likelihood   ratio, 

and,  again  reducing  the  equation  we  have  the  following: 

lnL(x.)  =  x.(  (M  .-/i  .)/a  ?  )  -  W  (/i  ^  -M  '  )/o  '  )  .         (A4) 

Then,   for  the  combined  evidence,  l(x^,X2,X3)  it  turns   out 

that  a  weighted  sum  of  the  evidence,  x^,  is  directly  related 

to  the  likelihood  ratio, 

n 

lnL(x^,X2,  .  .  .  ,XJ=S(  (M,,.-M^,.)/a^nx.  -h{{^^]  -M,^  )/o^^  )  .  (A5) 

1 
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APPENDIX  B 
BERG'S  THEORETICAL  SOLUTION  OF  THE  WEIGHTS 

Appendix   A  showed  that  the  optimal   decision   statistic 

for  a  Yes/No  task  is  a  weighted  sum  of  the  evidence  (Green  & 

Swets,    1966).     Berg's    Conditional-On-A-Single-Stimulus 

(COSS)  technique  is  based  on  this  assumption.   That  is,   the 

observer   responds  "signal"  when  the  evidence,  x^ ,   weighted 

by   some  arbitrary  number,  a^,   surpasses   some   criterion 

value,  D, 

n 

Ha.x.  >   D,  (Bl) 

1 

otherwise  the  observer  responds  "noise."   In  the  theoretical 

solution  of  the  individual  weights  Berg  begins  by   isolating 

a   single   element,   x^ ,  on  the  left  side   of   equation   Bl, 

producing  the  following  inequality: 

n 

Xj  >  D  -[Sa^Xj.  /  a^],      given  i  /  j.         (B2) 
J 

Then,  a  new  variable,  Y,,  is  substituted  for  the  right   side 
of  the  inequality.   Y.  is  also  normally  distributed  since  it 
represents   the   sum  of  independent,   normally  distributed, 
random  variables,  x^,  and  has  the  parameters: 

n 

E[Yj]  =  D  -  [Sa.x.  /  a^] 

(B3) 

n 

Var[Y.]  =  Sa^  a  ?  /  a?  . 

J 

Given  a  signal  trial  and  a  particular  source,  x^ ,  the  proba- 
bility of  saying  "signal"  given  Xj  is 
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P(S|x,.,s)  = 


X 

f(u|s)du,  (B4) 

—00 


where    f(x|s)   is  a  normal  density  function  with  a  mean 
E[yJs]   and   a  variance  =  VAR[Yj|s].    A   similar  density 
function  exits  for  noise  trials. 

Berg  uses  the  sum  of  the  variance  of  the  normal  density, 
VAR[Yj],  and  the  variance  of  the  original  x^ ,  o^^.  ,    to  derive 
the  estimated  weights.   One  element,  in  this  case  x.,  is  set 
to   unity   to  solve  for  the  remaining  weights   (k=l..m)   as 
shown  below. 


VARFY.]  +   a  '  Ea?  a  ?  /  a?     a^ 

'•  J  •■      ej         J  ej  '    1       k 


(B5) 


n 

VAR[Y,]  +  a^     Sa?a^  /  a^     a? 

'-  k  J      ek         1   ei   '    k        j 
1 


In  the  actual  derivation  of  the  weights,  we  know  the  values 
of  a  ^  .   We  need  to  find  the  values  of  VAR[Y.]  given  the  two 

el  '-  1  ■*   -' 

types  of  trials  (signal  and  noise) ,  and  we  need  to  specify 
which  element  is  set  to  unity.  The  values  VAR[Y.]  are 
estimated  by  finding  the  variances  of  the  cumulative  nor- 
mals with  the  best  Chi-square  fits  to  the  observer's  COSS 
functions.  Selection  of  the  element  to  be  set  to  unity  is 
arbitrary;  the  center  element,  x^,  was  selected  in  this 
study. 


^2,Xj,x^.    On  a  given  trial,  three  sources  represent   noise 


APPENDIX  C 
4AFC  DECISION  STATISTIC 

In  a  Four-Alternative-Forced-Choice  task   an  observer  is 

presented   four   independent   sources   of   information,   x^ , 

and  their  values  are  drawn  from  a  normal  distribution  with  a 
mean,  u  ,  and  a  standard  deviation,  a  .  The  value  of  the 
remaining  source  is  drawn  from  a  distribution  with  a  mean, 
u  and  a  standard  deviation,  o  ,  where  a  <  a  ,  and  a  =  a 
=  a  .  The  observer  has  to  decide  which  source  represents 
the  signal  event.  That  is,  in  a  4AFC  task,  the  observer  has 
to  decide  which  of  the  following  alternatives  is  true: 
<s,n,n,n>,  <n,s,n,n>,  <n,n,s,n>,  or  <n,n,n,s>. 

To   simplify  the  derivation  of  the   optimal   decision 
statistic.   Green  (1992)  considers  the  four  alternatives   as 
representing   four  possible  signals  (for  instance   <s,n,n,n> 
would   egual   Sg^)   which  are   compared  with   noise   alone, 
<n,n,n,n>.   That  is,  if  x  =  <x^ ,  x^,    Xj,  x^>  then 

f(x/s)f(x2/n)f(X3/n)f(xyn) 

l(x|Sg,.)  =    (CI) 

f  (x/n)  f  (x^/n)  f  (Xj/n)  f  (x,/n)  . 

Once  we  substitute  the  definition  for  the  conditional  proba- 
bilities  and  generalize  the  equation  for  all  possible   Sg. , 
equation  CI  becomes 
m 


M 


EXP[-!5((x.-M,)/aJ^  ]EXP[-i5((x,.-M3)/a^)^  ] 


l(x|Sg,)=    -j ^— ^ ^^— ^ (C2) 

m 

M  TEXP[-!5((x.-M,)/aJ^  ]EXP[-?5((x,.-M,)/aJ^  ], 
J 
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where   j   ^  i,  M  =  ( 1/ (27ra^  )  "'''^)  ,  and  m  =  4.   Once  we   reduce 
the  equation  we  have  the  following: 

l(x|Sg.)  =  EXP[Xj((M3-M,)/a^)  -  h{{nl  -M^  /a^  )  ]  •  (C3) 
Thus,  the  optimal  decision  statistic  is  to  choose  the  larg- 
est value,  Xj,  since  it  is  directly  related  to  the  largest 
likelihood  ratio. 


APPENDIX  D 
RELATIONSHIP  BETWEEN  YES/NO  d'  AND  mAFC  PERCENT  CORRECT 

Green  (1992)  shows  that  the  integral  in  equation  19   can 
be  expressed  in  terms  of  a  Yes/No  ROC  since  the  same  proba- 
bility density  functions,  f(x|s)  and  f(x|n),  can  be  used  to 
define  the  hit  and  false  alarm  probabilities.   If  we  let  a 
specific  signal  value,  x^,  equal  u  the  false  alarm  probabil- 
ity is  equal  to 


P„(S|n)  = 


00 


f (x I n) dx, 


(Dl) 


u 


and  the  hit  probability  is  equal  to 


P,(S|s)  = 


00 


f (x I s) dx. 


(D2) 


u 


The  complements  of  these  values  are 


1  -  P  (S  n) 


1  -  P„(S|s) 


f(x|n)dx,  and 

-00 


u 


f(x|s)dx,  respectively. 


(D3) 


(D4) 


-co 


Taking  the  derivative  of  equation  D4 ,  we  have  the  following, 

-dPJs|s)  =  f(u|s)du  (D5) 

Then  by  substituting  equations  D3  and  D5  into  equation   19. 
Green  (1992)  produces  the  following  equation: 


P2(C) 


00 


[1-P„  (S|n)]-dP.(s|s) 


(D6) 


-00 


When  the  criterion  value  is  low,  u  is  -oo,  then  the  hit 
probability  will  be  high,  P(S/s)  =  1.  Similarly,  when  the 
criterion  value  is  high,  u  =  oo,  then  the  hit  probability   is 
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low,  P(S/s)  =  0.  So  the  limits  of  integration  can  be 
replaced  by  these  values,  and  their  order  is  switched  by 
changing  the  sign  of  the  integral,  leading  to  the  following 
equation: 

'1 


P,(C) 


[1-P^  (s|n)]dP^(s|s)  (D7) 

0 


where   the  right  side  of  this  equation  is  equal  to  the   area 
under  a  Yes/No  ROC. 

Green  (1992),  then  shows  that  equation  D7  can  be  rewrit- 
ten for  m  alternatives,  where  m  >  2.  For  an  m-alternative 
forced-choice  task,  as  the  one  used  in  the  second  study, 
there  are  m-1  noise  samples.  The  probability  of  a  correct 
response  in  this  case  is  equal  to  the  probability  that  the 
signal  sample,  x^  (or  in  this  discussion  u) ,  is  greater 
than  all  m-1  noise  samples.  Thus,  equation  D7  is  rewritten 
to  account  for  the  m-1  noise  samples  giving  us  the  following 
equation: 

'1 
PJc)  =     [1-P^  (S|n)]"'-Mp^  (S|s).  (D8) 

0 
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